
PCT 

INTERNATIONAL APPLI 



WORLD INTELLECTUAL PROPERTY ORGAN1ZA 
International Bureau 



ON PUBLISHED UNDER THE PA 



wTEN^^X 




£>OPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C07K 14/32, A61K 301/385, C12N 15/62, 
15/75 



A2 



(11) International Publication Number: 
(43) International Publication Date: 



WO 95/19371 

20 July 1995 (20.07.95) 



(21) International Application Number: PCT/EP95/00147 

(22) Internationa] Ftting Date: 13 January 1995 (13.01.95) 



(30) Priority Data: 
9400650.9 



14 January 1994 (14.01.94) GB 



(71) Applica nt (for all designated States except US): SOLVAY 

(SOCIFIE ANONYME) fBE/BE]; 33, rue du Prince- Albert, 
B-1050 Bruxelles (BE). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): DEBLAERE, Rolf, Y. 
[BE/BE]; Schoolstraat 24, B-9950 Waarschoot (BE). DE- 
SOMER, Jan [BE/BE]; Nieuwewandeling 58, B-9000 Gand 
(BE). DHAESE, Patrick [BE/BE]; Jean Fobelaan 8, B-9810 
Drongen (BE). 

(74) Agents: MEYERS, Liliane et ah; Solvay (Soci&e* Anonyme), 
Dept. de la Propria Industrielle, 310, rue de Ransbeek! 
B-1I20 Bruxelles (BE). 



(81) Designated States: JP, US, European patent (AT, BE, CH. DE 
DK, ES, FR, GB. GR, IE, IT, LU, MC, NL, PT. SE). 

Published 

Without international search report and to be republished 
upon receipt of that report. 



(54) Title: EXPRESSION OF SURFACE LAYER PROTEINS 
(57) Abstract 



AnnDeClercq. 

P «ent Office 



BEST AVAILABLE COPY 



FOR THE PURPOSES OF INFORMATION ONLY 

Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AT 


Austria 


GB 


United Kingdom 


AU 


Australia 


GE 


Georgia 


BB 


Barbados 


GN 


Guinea 


BE 


Belgium 


GR 


Greece 


BF 


Burkina Faso 


HU 


Hungary 


BG 


Bulgaria 


IE 


Ireland 


BJ 


Benin 


IT 


Italy 


BR 


Brazil 


JP 


Japan 


BY 


Belarus 


KE 


Kenya 


CA 


Canada 


KG 


Kyrgystan 


CF 


Central African Republic 


KP 


Democratic People's Republic 


CG 


Congo 




of Korea 


CH 


Switzerland 


KR 


Republic of Korea 


CI 


Cote d'lvoire 


KZ 


Kazakhstan 


CM 


Cameroon 


LI 


Liechtenstein 


CN 


China 


LK 


Sri Lanka 


cs 


Czechoslovakia 


LU 


Luxembourg 


cz 


Czech Republic 


LV 


Latvia 


DE 


Germany 


MC 


Monaco 


DK 


Denmark 


MD 


Republic of Moldova 


ES 


Spain 


MG 


Madagascar 


FI 


Finland 


ML 


Mali 


FR 


France 


MN 


Mongolia 


GA 


Gabon 







MR Mauritania 

MW Malawi 

NE Niger 

NL Netherlands 

NO Norway 

NZ New Zealand 

PL Poland 

PT Portugal 

RO Romania 

RU Russian Federation 

SD Sudan 

SE Sweden 

SI Slovenia 

SK Slovakia 

SN Senegal 

TD Chad 

TG Togo 

TJ Tajikistan 

TT Trinidad and Tobago 

l)A Ukraine 

US United Stales of America 

UZ Uzbekistan 

VN Viet Nam 



95/19371 




PCT7EP95/00147 



Expression of surface layer proteins 



The present invention relates to vaccines and proteins, rDNA molecules 
encoding protein expression and presentation systems for the production and 
presentation of the said proteins, expression vectors therefor, and hosts 
transformed therewith, as well as methods involved therewith. 

The traditional view on the native polymeric organization of the 
bacterial cell wall has changed dramatically over recent years with the 
development of new techniques for electron microscopic analysis. The 
classical idea that the cell membrane(s) is(are) covered by a peptide-glycan- 
containing matrix does not hold any longer. Besides additional surface 
structures such as capsules, sheaths, slimes or fimbriae, proteinaceous 
surface arrays or S-layers are being recognized as a main constituent of the 
bacterial cell wall (Sleytr and Messner, 1988). 

S-layers are a common feature in archaebacterial surfaces (Konig, 
1988). In some species such as Halobacterium salinarum or Thermoproteus 
spp. the proteinaceous S-layer even forms the sole cell wall. At present S- 
layers are being detected with increasing frequency in a large range of 
gram-positive and gram-negative eubacteria. Surface arrays are composed 
of protein or glycoprotein subunits that are arranged into a paracrystalline 
two-dimensional array, displaying hexagonal, tetragonal or oblique 
symmetry. Self-assembly of the S-layer is an inherent property of the 
subunit and is the result of non-covalent protein-protein interactions 
mediated through salt bridging by divalent metal cations (Mg 2 **- or Ca 2+ ). 
Non-covalent interactions with components of the underlying cell envelope 
are thought to be responsible for its positioning at the outermost surface. 

Despite the cloning and the characterization of several genes encoding 
S-layer proteins (SLP's), their function still remains speculative. A variety 
of functions have been attributed to surface arrays. They might serve as a 
protective barrier against degradative enzymes or predators, such as 
Bdellovibrio or help in maintaining bacterial cell shape and form. In some 
bacterial pathogens, S-layers have been identified as important virulence 
factors. Although S-layers have several physical features in common, 
general conclusions on their function cannot yet been drawn. 
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SLP . S are thus present in a .arge number of , 

continuously and spontaneously produced in larger am 

nX^sefand either presenter, or secreted by systems 
.Jfor ceus. The genes of these SLP s y s,em(s) * 

promoter sequence^ a signal peptide coding sequence whtch treated 
downstream of the promoter sequel), a SU> codtng sequent ^ 
^senption refutation sequence. The SLP codutg sequence s Ice red 
downstream from the signal peptide coding sequence, havmg : ts S- 

an SLP secretion system. In the former, the SLP s are Jjoun p 
„aU of a host where they are thus presented. In the .arte , the SLP s are 
^produced in the cy.op.asm (in.mceUu.ar production, or secreted tnto 
,he surrounding medium (extracellular secretton) _ 

The SLP expression and secretion systems of several bacteria 
we „«Jd. Among these are those SLP expression and secretton 
XmsTf bacerla of the genus tf.. ere weU-trtown as 

—on and secretion system of the 

speeds e4u^ »as ««» — siVe,y •"* d fOT " S ""T J 
^rassmTandlx^eUuiarly secreting .arge quanrities o, preterm nred 

ZZT B. brevis is able to secrete large amounts of extraceUu.ar SLP 
^ re^dlTaid translocation of the predetermined p~ *ss t 
brevis-s unique two-layer cell waU for extracellular secretton thereof . Mso, 
f^vi, does no. secrete extraceUular proteases i, quanm.es whtch may 
degrade and inactivate the extraceUularly-produced protems. 

8 Tsukagoshi B „ 0985) disdoses the rasion of the ^-amylase £ e of 
^^j^^ oV-5 ,0 the SLP codnog gene of 1^ 
— ' — — ^To^yUse m ajjearastarmapta^ DV-5SJ« 
w ittWl HB.O. and ^m^^ >A289 hosts that are 
l^d^Th. Comparison studies showed that .he , BJ»*B 
ZZn levels were one hundred (.00) times higher than tha, of E. 
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stearothenmophilius itself. B. brevis secretion levels were fifteen (15) times 
higher than those of E. coli and five (5) times higher than those of 
subtms. The efficient secretion of the enzyme in B. brevis is suggested 
therein as being due to the unique properties, of the cell wall of the B. 
brevis . 

Yamagata et al (1987) discloses the translational fusion of the 5 '-region 
of the gene coding for the middle walJ protein (a SLP particular to B, 
brevis) of B. brevis 47 with the a-amylase gene of Bacillus licheniformis 
for expression in B. brevis 47. The translational fusion of these genes is 
reported as achieving efficient levels of a-amylase production in B. brevis 
47. 

Tsukagoshi (1987/8) discloses the translational fusion of the gene 
coding for swine pepsinogen with the 5 '-region of the middle wall protein 
gene of B. brevis for expression in B. brevis 47 and B. brevis HPD3 1 . f 
Translational fusion of the 5 '-region to the CGTase gene of Bacillus 
macerans also resulted in the expression of the efficient levels of CGTase in 
B. brevis 47. 

EP-A-0257189 in the name of Higeta Shoyu Co., Ltd., et aj., discloses 
a series of B. brevis strains which may be utilised as hosts to produce large 
amounts of proteins without producing deleterious amounts of extracellular 
proteases. 

GB-A-2 182664 in the name of Udaka discloses "expressing genes" that 
are derived from B. brevis 47 and which may be fused to genes coding for 
heterologous proteins. Among the heterologous genes suggested as being 
appropriate for being fused to the genes derived from B. brevis 47 are 
various eucaryotic genes (such as those genes coding for interferon and 
insulin) as well as procaryotic genes (such as those genes coding for 
tryptophanase and aspartate ammonia lyase). The fused genes may then be 
incorporated into expression vectors for transforming B. brevis 47. 

Adachi et al (1989) discloses the fusion of the co-transcriptional cell 
wall protein (cwrj) gene operon (coding for both the middle wall protein and 
the outer wall protein) of B. brevis 47 with the gene coding for a-amylase 
in B. licheniformis in order to provide extracellular production of B 
licheniformis a-amylase by B. brevis 47 and B. subtilis 1A289. The 
presence of several different cwp operon transcripts and the presence of at 
least three different promoters (referred as therein as the PI, P2 and P3 
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promoters) were confirmed. It was reported that the PI and P3 promoters 
were used in the same extent in B. brevis and B. subtilis . whereas the P2 
promoter was reported to be used much less frequently in B. subtilis than in 
B. brevis . 

5 Takao et al (1989) discloses an expression-secretion vector for 

transforming B. brevis hosts for producing heterologous proteins, including 
eucaryotic proteins, such as swine pepinsogen. The vector utilizes the 
promoter, the signal-peptide coding sequences and nine (9) amino-terminal 
amino acids of a middle wall protein of B. brevis which are fused to a 

10 heterologous protein coding sequence. The hosts transformed thereby are 
B. brevis 47 and HPD31. 

Yamagata et al (1989) discloses a host-vector system utilizing strains 
(47 and HPD31) of B. brevis that hyperproduce SLP's as the hosts. 
Expression-secretion vectors are constructed from multiple promoters, the 

15 peptide-signal coding region and a structural gene for one of the major cell 
wall proteins of B. brevis 47. The B. brevis 47 genes were fused to a 
synthetic gene coding for human epidermal growth factor (hEGF). 

In addition to the use of SLP expression and secretion systems derived 
from B. brevis in B. brevis . it has also been disclosed to utilize SLP 

20 expression and secretion systems of B. brevis in B. subtilis hosts. Tsuboi et 
al (1989) discloses the transformation of B. subtilis with genes from B. 
brevis 47 that code for middle wall proteins. The transformed B. subtilis is 
thus capable of expressing the middle wall protein of B. brevis . 

It has also been disclosed by Tang et al (1989) that the SLP expression 

25 and secretion system of an alkaline phosphatase secretion-deficient mutant 
strain (strain NM 105) of B. Ucheniformis 749/C can be cloned into mutant 
strains of both E. coli (strain NM 539) and B. subtilis (strain Mil 12). 
Bowditch et al 1989 discloses cloning the gene coding for the SLP of 
B.sphaerius into E. coli TBI, JM101 and JM107. The transformed E. coli 

30 hosts then expressed the B. sphaericus SLP. 

Lucas et al (1994) . while studying the S-layer protein of Acetogenium 
kivui, disclose that there exists a repeated peptide sequence at the N- 
terminus of said S-layer protein which is shared by several different S-layer 
proteins, such as the middle wall protein from B. brevis and the S-layer 

35 protein form B. sphaericus 2362, and these authors suggest that this 
conserved domain is essential to anchor these S-layer proteins to the 
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underlying peptidoglucan. Interestingly, Matuschek et al. (1994) disclose 
that the same conserved domain, which is found at the N-terminus of the 
Acetogenium kivui S-layer, is also present in the sequence of the 
extracellular, cell-bound pullulanase from Thermoanaerobacterium 
5 thermosulfurigenes, but in the latter case it is located near the C-terminus of 
the polypeptide. 

US-A-5043158 discloses pharmaceutical compositions which comprise 
carriers that are chemically-coupled to epitope-bearing moieties. The 
carriers are isolated crystalline or paracrystalline glycoproteins, especially 

10 those derived from the SLP's of Clostridium thermohvdrosulfuricum and JL 
stearothe rmophilus . The conjugates formed thereby were reported as being 
capable of eliciting the formation of antibodies as well as eliciting B-cell 
mediated and T-cell mediated responses. 

It is a primary objective of the present invention to provide a 

15 recombinant DNA (rDNA) molecule that includes a SLP system capable of y 
expressing and presenting, rather than expressing and secreting, a fusion 
polypeptide (such as a fused SLP/antigenic peptide) in a wide variety of 
bacteria including bacteria of the genus Bacillus and, more particularly, IL 
sohaericus. 

20 It is yet another primary object of the present invention to provide such v 

a rDNA molecule which includes, derived from B. sphaericus . SLP 
promoter sequence(s), a SLP signal-peptide coding sequence and a SLP 
coding sequence which codes for at least a functional portion of the surface 
layer protein of B. sphaericus and which may be fused to a heterologous 

25 coding sequence coding for a heterologous polypeptide (such as an antigenic 
peptide), such that the expression of the heterologous polypeptide is placed 
under the control of the said promoter(s) and further such that the 
heterologous polypeptide expressed thereby will be fused to the SLP so as 
to be bound-up in the cell wall of the host for presentation thereof on the 

30 outer surface of the host's cell wall for eliciting an immunogenic response 
thereto. 

It is a yet further primary object of the present invention to provide 
vectors containing such rDNA molecules, which vectors may be used to 
effectively transform host cells. 
35 It is a still yet further primary object of the present invention to provide 

hosts, especially hosts of the genus Bacillus , and more particularly of the 
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species B. sphaericus , which are transformed with vectors containing such 
rDNA molecules, which express fusion polypeptides (such as antigenic 
peptides) produced thereby and which present the expressed fusion 
polypeptides for, inter alia, eliciting an immunogenic response thereto. 
5 A still further primary object of the present invention is to provide 

methods for forming the rDNA molecules, for preparing the appropriate 
vectors therefor, for transforming hosts with such vectors and for producing 
the fusion peptides (vaccines and proteins) of the present invention. 

The present invention provides a host cell which is provided with a S- 
10 layer comprising a fusion polypeptide consisting essentially of: 

(a) at least sufficient of a SLP for a S-layer composed thereof to assemble, 
and 

(b) a heterologous polypeptide which is fused to either the carboxy 
terminus of (a) or the amino terminus of (a) and which is thereby 

15 presented on the outer surface of the said cell. 

Preferably, the heterologous polypeptide is fused to the carboxy 
terminus of (a). 

The amount of a SLP which is sufficient in its own right for a S-layer 
composed thereof to form is termed herein the "functional portion" of the 

20 SLP. The fusion polypeptide thus typically incorporates at least the 

functional portion of a SLP native to the host cell. Sacculi derived from a 
host cell according to the invention also form part of the invention. 

The heterologous polypeptide may be an antigenic peptide. In that 
event, the invention provides a vaccine comprising a host cell or sacculi 

25 according to the invention wherein the heterologous polypeptide is an 

antigenic peptide and a pharmaceutically or veterinarily acceptable carrier or 
diluent. 

The invention further provides a recombinant DNA molecule which 
comprises a promoter operably linked to a coding sequence which encodes a 
30 signal peptide and a fusion polypeptide, the signal peptide being capable of 
directing the said fusion polypeptide to be presented on the surface of a host 
cell in which expression occurs and the fusion polypeptide consisting 
essentially of a heterologous polypeptide fused to either the carboxy 
terminus or to the amino terminus of at least sufficient of a SLP for a S- 
35 layer composed thereof to assemble. 

An efficient and reliable system which employs an SLP expression and 
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presentation system is therefore provided for the expression and presentation 
of fusion polypeptides (such as antigenic peptides for vaccines) in a wide 
variety of, preferably, Bacilli . This system includes a recombinant DNA 
molecule having a promoter that is ftised to a functional DNA sequence, so 
that the functional DNA sequencers placed under the control of the 
promoter. The functional DNA sequence includes a SLP coding sequence 
which codes for at least a functional portion of a SLP. The functional DNA 
sequence further includes a heterologous polypeptide coding sequence that 
codes for a heterologous polypeptide (such as an antigenic peptide for use as 
a vaccine or a protein) which peptide coding sequence is fused to the SLP 
coding sequence. 

As used herein, the term "functional DNA sequence" refers to DNA 
sequences that contain all of the various sequences (with the exception of 
the promoter sequence), both coding (such as sequences coding for the % 
proteins whose expression and presentation is desired) and non-coding (such 
as control sequences and regulatory regions, i.e. sequences that are 
necessary or desirable for the transcription and translation of a coding 
sequence to which they are operably linked or fused when they are 
compatible with the host into which they are placed) which, when operably 
joined (by linking, fusing or otherwise) to a promoter and placed into a 
compatible host, permit the sequence to be operational and express and 
present the protein(s) coded for by the coding sequence(s) thereof. 

As used herein, the terms "presented", "presentation", "present" and/or 
"presents" refer to the maimer in which the heterologous polypeptide (for 
example an antigenic peptide or protein) is positioned when provided as part 
of a hybrid particle (such as the fusion vaccine or fusion protein) in such a 
way as to elicit an immune response to the heterologous polypeptide. 

"Presentation systems" are DNA sequences which include both a coding 
sequence coding for a polypeptide (such as a heterologous polypeptide) 
whose presentation is desired, and other appropriate sequences therefore 
which permit such presentation when the DNA sequences are compatible 
with the host into which they are placed. 

"Expressions systems" are DNA sequences which include both a coding 
sequence coding for polypeptide(s) whose expression is desired and 
appropriate control sequences therefor which pennits such expression when 
the DNA sequences are compatible with the host into which they are placed. 
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As is generally understood, "control sequences" refers to DNA segments 
which are required for, or which regulate, expression of the coding 
sequence with which they are operably joined. 

We have found that it is the amino-terminal portion of a SLP which is 
5 sufficient for S-layer formation. We have found that more than the first 
19.56% by number of the amino acid residues, as measured from the 
amino-terminus of a mature SLP, are required for a S-layer to form. By a 
"mature SLP" we mean a SLP without signal peptide residues. More than 
the N-terminal 239 amino acid residues of the mature SLP shown in Figure 

10 6 are thus required. The first 41 .41 % by number of the amino acid 

residues of a mature SLP are sufficient, for example the first 506 amino 
acid residues of the mature SLP shown in Figure 6. 

More than the first 20% amino acid residues of an active SLP may be 
present. The SLP portion of a fusion polypeptide may therefore consist of 

15 the first N-terminal 28% or more or 35% or more, for example the most N- 
terminal 41 % or more or 50% or more or 60% or more or 80% or more, 
or even all, amino acid residues of a mature SLP such as the mature SLP 
shown in Figure 6. From the first N-terminal 28% to all, for example from 
the first N-terminal 35% or 41 % or 50% or 60% or 80% to all, of the 

20 amino acid residues of a mature SLP can be present. The first N-terminal 
400 or more, 600 or more, 800 or more or 1000 or more amino acid 
residues of a mature SLP may be present. A convenient restriction site in 
the DNA coding sequence of a SLP will typically determine the C-terminus 
of the SLP portion of a fusion polypeptide. 

25 The SLP portion of a fusion polypeptide is typically homologous with 

respect to the host cell in which the fusion polypeptide is expressed. In 
other words, the portion of the SLP which is present in the fusion 
polypeptide should generally be from a SLP of the same species as the host 
in which the fusion polypeptide is expressed. Typically the portion of the 

30 SLP in the fusion polypeptide is from the native SLP of the host in which 
the fusion polypeptide is expressed. The fusion polypeptide may 
incorporate an appropriate portion of a SLP of a bacterium of the genus 
Bacillus , for example of the species B. brevis or B. sphaericus . 
The heterologous polypeptide may be a physiologically active 

35 polypeptide or a foreign epitope (an antigenic determinant, peptide 

immunogen or epi tope-bearing moiety, as shall be discussed at greater 
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length below). The carboxy terminus of the functional portion of a SLP 
may be fused directly to the amino terminus of the physiologically active 
polypeptide or the foreign epitope. The fusion polypeptide therefore may 
consist essentially of the functional portion of a SLP and, fused directly to 
the carboxy terminus thereof, a heterologous amino acid sequence. 
Alternatively, the amino terminus of the functional portion of a SLP may be 
fused directly to the carboxy terminus of the physiologically active 
polypeptide or the foreign epitope. The fusion polypeptide therefore may 
consist essentially of the functional portion of a SLP and, fused directly to 
the amino terminus thereof, a heterologous amino acid sequence. 

Alternatively, an intervening linker sequence may be present between 
the functional portion of the SLP and the heterologous polypeptide. The 
linker sequence may be from 1 to 20, for example, from 1 to 5 or from h to 
10 amino acid residues long. The linker sequence may be designed to 
incorporate a cleavage site recognized by cyanogen bromide or a cleavage 
enzyme. *t 

The heterologous polypeptide is a polypeptide whose expression is not 
normally controlled by a SLP promoter i.e. is not a naturally occurring r 
SLP. The heterologous polypeptide can be a physiologically active 
polypeptide such as an enzyme. The polypeptide may be a polypeptide drug 
or a cytokine. Specific polypeptides which may be mentioned are or- 
amylase, tissue plasminogen activator, luteinizing hormone releasing 
hormone, a growth hormone such as human growth hormone, insulin, 
erythropoietin, an interferon such as a-interferon, and calcitonin. 

Alternatively, the heterologous polypeptide may comprise a foreign 
epitope or polypeptide immunogen. The polypeptide immunogen therefore 
typically comprises an antigenic determinant of a pathogenic organism. The 
immunogen can be an antigen of a pathogen. The pathogen may be a virus, 
bacterium, fungus, yeast or parasite. The foreign epitope may be an 
epitope capable of inducing neutralizing or non-neutralizing antibody or of 
inducing a cellular immune response. 

The immunogen or epitope may be derived from a virus such as a 
human immunodeficiency virus (HIV) such as HIV-1 or HIV-2; a hepatitis 
virus such as hepatitis A, B or C; a poliovirus such as poliovirus type 1, 2 
or 3; influenza virus; rabies virus; or measles virus. Examples of bacteria 
from which an immunogen or epitope may be derived include B. pertussin 



r 
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C, tetani . V. cholera . N meningitides . N. gonorrhoea, C. trachomatis and 
E. coll . The immunogen may therefore be the P69 antigen of B. pertussis , 
pertussis toxin or a subunit thereof, tetanus toxin fragment C, E^coU heat 
labile toxin B subunit (LT-B) or an E. coli K88 antigen, or an antigenic 
5 portion thereof. An immunogen derived from a parasite may be an 
immunogen derived from P. falciparum , a causative agent of malaria. 

As used herein, the terms "antigenic peptide", "antigenic determinant", 
"peptide immunogen", "polypeptide immunogen", "epitope" and "epitope- 
bearing moiety" all refer to substances that contain a specific determinant 
10 which induces an immune response (such as the production of antibodies or 
the elicitation of T-cell mediated response). The substance may itself be a 
hapten (i.e. a simple moiety which, when rendered immunogenic, behaves 
as an antigen) or it may be a more complex moiety, only portions of which 
are responsible for immuno specificity with regard to the antibodies 

15 obtained. 

As used herein, the terms "immunogenic response" and "immune 
response" refer to the biological responses, such as the raising of antibodies 
or the elicitation of T-cell or B-cell mediated responses, that are elicited in 
an organism (such as a mammal) by the presence of an antigen or 

20 immunogen. 

The present invention also provides recombinant DNA vectors 
comprising a recombinant DNA molecule according to the present invention 
and further provides a host cell transformed with such a recombinant DNA 
vector. The vector is typically an expression vector. The fusion 

25 polypeptide can thereby be expressed in a suitable host cell transformed 
with such an expression vector. A S-layer composed of the fusion 
polypeptide that is expressed can thereby be assembled on the surface of the 
host cell. 

An expression vector can include any suitable origin of replication 
30 which will enable the vector to replicate in a bacterium. A ribosome 
binding site is provided. The ribosome binding site is suitably located 
between the promoter and the DNA sequence encoding the heterologous 
polypeptide. If desired, a selectable marker gene such as an antibiotic 
resistance gene can be provided in the vector. The vector is generally a 
35 plasmid. 

The vector is normally provided with a transcriptional termination 
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sequence. The coding sequences of the recombinant DNA molecules and 
vectors of the invention are provided with translational start and stop 
codons. Vectors may be constructed by assembling all appropriate elements 
using techniques known in the art (Sambrook et al, 1989). 
5 According to the invention, a host cell provided with a S-layer 

comprising a fusion polypeptide is prepared by a process which comprises : 

(i) providing a suitable host cell incorporating a recombinant DNA 
molecule which comprises a promoter operably linked to a coding 
sequence which encodes a signal peptide and a fusion polypeptide, the 

10 signal peptide being capable of directing the said fusion polypeptide to 

be presented on the surface of the said host cell and the fusion 
polypeptide consisting essentially of a heterologous polypeptide fused to 
either the carboxy terminus or the amino terminus of at least sufficient 
of portion of a S-layer protein for a S-layer composed thereof to 

15 assemble on the surface of the said host cell; and 

(ii) culturing the said host cell so that the said fusion polypeptide is 
expressed and a S-layer comprising the fusion polypeptide is formed on 
the surface of the said host cell, the heterologous polypeptide thereby 
being presented on the outer surface of the said host cell. 

20 In a preferred variant of the invention, a host cell provided with a S- 

layer comprising a fusion polypeptide is prepared by a process which" 
comprises : 

(i) providing a suitable host cell incorporating a recombinant DNA 
molecule which comprises a promoter operably linked to a coding 

25 sequence which encodes a signal peptide and a fusion polypeptide, the 

signal peptide being capable of directing the said fusion polypeptide to 
be presented on the surface of the said host cell and the fusion 
polypeptide consisting essentially of a heterologous polypeptide fused to 
the carboxy terminus of at least sufficient of the amino terminal portion 

30 of a S-layer protein for a S-layer composed thereof to assemble on the 

surface of the said host cell; and 

(ii) culturing the said host cell so that the said fusion polypeptide is 
expressed and a S-layer comprising the fusion polypeptide is formed on 
the surface of the said host cell, the heterologous polypeptide thereby 

35 being presented on the outer surface of the said host cell. 

Preferably the host cell is one which does not secrete extracellular 
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proteases. The host cell is generally a bacterium, typically a bacterium 
which naturally produces a S-layer protein, i.e. a bacterium which in its 
native state has a S-layer on its surface. Depending upon the intended use, 
the bacterium may be a gram-positive or gram-negative bacterium. Host 
5 bacteria include bacteria of the genera Cocci and Bacilli may be 

transformed, for example Sta phylococcus , Streptococcus, Corynehacterium , 
T^ctobacillus . Bacillus . Clostridium and Listeria . Preferably, host bacteria 
include bacteria of the genera Bacilli . Useful bacteria in which the present 
invention may be applied are therefore Bacillus sphaericus and B. brevis . 

10 Rar.iHns s phaericus is a bacterium of the genus Bacillus in which a 

substantial quantity of the SLP's produced thereby are bound-up in the S- 
layer of the cell wall thereof and are not secreted extracellularly. As such, 
unlike B. brevis and B. subtilis . we have found that B. sphaericus possesses 
what is potentially an efficient SLP presentation system. 

15 The structure and properties of B. s phaericus have been characterized 

(see, for example Lewis et al (1987); Howard et al (1973) and Lepault et al 
(1986)). 

B. sphaericus (like the other Bacilli l has a high level of growth throughout 
its growth cycle, thereby increasing the quantities of the fusion polypeptide 

20 that can be expressed and presented thereby. 

A preferred strain of B. sphaericus is B. sphaericus 
P-l. B. sphaericus P-l has been deposited under the Budapest Treaty of the 
Belgian Coordinated Collections of Microorganisms (BCCM), LMG Culture 
Collection, Universiteit Gent, Lab. voor Microbiologic, K.L. 

25 Ledeganckstraat 35, B-9000 Gent, Belgium. The deposit was made on 13th 
May 1993 and was given accession number LMG P-13855. B. sphaericus 
P-l offers the further advantage of not producing detectable levels of 
extracellular proteases which can cause damage to fusing polypeptides 
produced according to the invention. 

30 The signal peptide is typically a signal peptide for a SLP, for example 

for a SLP of a bacterium of the genus Bacillus . It may be a signal peptide 
for a SLP of B. brevis . B. sphaericus or B. subtilis for example B, 
s phaericus P-l . Preferably the signal peptide is the signal peptide for the 
SLP of which an appropriate portion is incorporated in the fusion 

35 polypeptide. A signal peptide which is homologous, i.e. which is derived 
from the same species of cell, with respect to the host cell in which 
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expression of the fusion polypeptide is to occur can be employed. 
Preferably the native signal peptide for the native SLP of the host cell in 
which expression is to occur is provided. 

A useful process for preparing a host cell provided with a S-layer 
comprising a fusion polypeptide comprises: 

(a) providing an intermediate vector in which the coding sequence for an 
internal portion of the native SLP of the said host cell has 
translationally fused to the 3 '-end thereof the coding sequence for the 
heterologous polypeptide and in which the said coding sequences are 
provided upstream of a promotorless selectable marker gene such that 
they form a translational or transcriptional fusion therewith; 

(b) transforming the said host cell with the intermediate vector;' 

(c) selecting a transformed host cell which has a S-layer comprising the 
said fusion polypeptide. ; 
This process relies upon the occurrence of a single homologous 

recombination as a result of the introduction of the intermediate vector intd 
the host cell. The intermediate vector is typically a plasmid. An internal 
portion of the native SLP lacks the amino-terminal and carboxy-terminal 
amino acid residues of the native SLP. Up to the first 50, up to the first 
100, up to the first 200, up to the first 300, up to the first 400 or up to the 
first 500 of the amino-terminal amino acid residues may be missing. 
Independently up to the first 50, up to the first 100, up to the first 200, up 
to the first 300, up to the first 400 or up to the first 500 carboxy-terminal 
amino acid residues may be missing. 

The coding sequence for an internal portion of the native SLP therefore 
corresponds to the native SLP gene lacking its 5'- and 3'-ends. This coding 
sequence can be fused directly or via a sequence encoding a linker to the 5'- 
end of the coding sequence for the heterologous peptide. Suitable linkers 
are described above. The promoterless selectable marker gene may be the 
neomycin phosphotransferase H (npjn) gene which confers resistance to the 
antibiotics neomycin and kanamycin. The intermediate vector typically also 
comprises an origin of replication and a second selectable marker gene, for 
example an antibiotic resistance gene such as an erythromycin resistance 
gene. 

In one preferred embodiment, therefore, a host cell having a S-layer 
comprising a fusion polypeptide can be prepared by the following 
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procedure: 

1 . An appropriate intermediate vector is constructed that has following 
characteristics : 

- the cloned part of the SLP in the vector has to be internal to the SLP 
5 gene, i.e. contain no borders of the gene; 

- the cloned part of the SLP gene is translationally fused to the sequence 
encoding the heterologous peptide of interest; 

- both are cloned upstream of a promotorless first selectable marker gene 
(e.g. the nptn gene) so that they make a translational or transcriptional 

10 fusion; 

- optionally a replicon (such as that of pIL253 for R. sphaencus ) and/or a 
second selectable marker gene (such as the erythromycin (Em) resistance 
gene). 

2. The intermediate vector is introduced into an appropriate host cell such 
15 as K s phaericus P-l , for example via electroporation. 

3. Transformants are selected by means of the second selectable marker, 
for example Em resistant transformants selected. This can enable the 
structure of the intermediate vector to be verified. 

4. The selected transformant(s) are grown, for example overnight in LB 

20 medium containing 10 ng/m\ Em. 

5. The transformants thus grown are plated out. For example a bacterial 

suspension can be plated out directly on LB+ agar containing 5-10 
M g/ml neomycin (Nm) when the first selectable marker gene is the nptn 
gene or dilute starter culture in LB liquid medium containing the same 
25 amount of Nm, again when the first selectable marker gene is the npJJI 

gene. 

6. Colonies are selected by means of the first selectable marker, for 
example single Nm resistant colonies. 

7. Occurrence of a single homologous recombination is verified, for 
30 example by Southern analysis. 

8. Formation of the recombinant fusion polypeptide is verified, for 
example by sodium dodecyl sulphate-polyacrylamide gel electrophoresis 
(SDS-PAGE). 

Alternatively, a host cell provided with a S-layer comprising a fusion 
35 polypeptide can be produced by a process comprising: 

(a) fusing to a promoter a SLP coding sequence coding for the signal 



PCT/EP95/00147 



- 15 - 

peptide and at least sufficient of the amino-terminal portion of a SLP 
for a S-layer composed thereof to assemble on the surface of the host 
cell, and fusing a peptide coding sequence coding for the heterologous 
polypeptide to the 3 '-end of the SLP coding sequence, whereby a 
recombinant DNA molecule for the expression and presentation of the 
fusion polypeptide is prepared; 

(b) inserting the recombinant DNA molecule into a suitable vector, 
whereby a recombinant DNA vector is prepared; 

(c) transforming a suitable host ceU with the recombinant DNA vector, 
whereby a transformed host cell having the recombinant DNA molecule 
is provided; 

(d) culturing the transformed host cell, whereby the fusion polypeptide is 
expressed and a S-layer comprising the fusion polypeptide is assembled 
on the host cell wall. 

The transformed host cell is thus cultured in an appropriate culture 
medium. As a result of expression and presentation of the fusion £i 
polypeptide on the outer surface of the host cell, the heterologous 
polypeptide is thus presented so that an immunogenic response can be 
stimulated thereto when the host cell is administered to a human or animal 
host. The S-layer will also comprise the host cell's native SLP unless steps 
are taken to disable production of that SLP. 

DNA sequences consisting essentially of the appropriate coding 
sequences may be produced by ligation. The DNA sequences may be 
isolated and/or purified for use in the invention. Expression vectors can 
thus be prepared which incorporate a promoter operably linked to one of 
these DNA sequences. These vectors are capable of expressing the fusion 
polypeptide when provided in a suitable host. The vectors are generally 
plasmids. 

The coding sequences are located between translation start and stop 
signals. A ribosome binding site, an origin of replication and, optionally, a 
selectable marker gene such as an antibiotic resistance gene are typically 
present. In addition to the promoter, other appropriate transcriptional 
control elements are provided, in particular a transcriptional termination 
site. The promoter may be the natural promoter for a SLP protein such as 
a promoter for a Bacillus SLP, for example for a SLP from B.sphaericus or 
B.brevis. Typically the promoter is the native promoter for the SLP at least 
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the functional portion of which is present in a fusion polypeptide according 
to the invention. The coding sequences are provided in the correct frame 
such as to enable expression of the fusion polypeptide to occur in a host 
compatible with the vector. 

5 Transformation of a host cell may be achieved by conventional 

methodologies. We have found, however, that such methodologies do not 
work in the case of B. snhaericus P-l. We have devised a new technique 
for transforming B. sphaericus P-l. Accordingly, the present invention 
provides a process of transforming B. sphaericus P-l cells with DNA, 

10 which process comprises harvesting B. sphaericus P-l cells at the late 
stationary growth phase, mixing the harvested cells with the DNA and 
effecting electroporation to cause entry of the DNA into the said cells. 

Electroporation at the late stationary phase may be effected at from 8 to 
16 kV/cm, 150 to 250 0 and 20 to 40 j*F. Preferred conditions are 

15 12kV/cm, 200 fl and 25 fiF. Electroporation is generally carried out in 
electrocurvettes, for example 0. 1 cm- or 0.2 cm-gapped electrocurvettes. 

The transformed host cells are cultured under such conditions that 
expression of the fusion polypeptide occurs. The invention consequently 
additionally provides a host cell transformed with a recombinant DNA 

20 molecule, typically a vector, according to the invention. 

The host cell can be transformed so that none of the native SLP is still 
produced or so that the native SLP is produced in addition to the fusion 
polypeptide according to the invention. The invention therefore further 
provides a host cell which is able to express a fusion polypeptide according 

25 to the invention in addition to or instead of he SLP native to the said host 
cell. 

A host cell can therefore be engineered which presents a foreign epitope 
on its surface as a part of a composite S-layer. The S-layer incorporates the 
fusion polypeptide. The fusion polypeptide (for example, presenting a 

30 foreign epitope) and any native SLP produced by the host cell assemble into 
a S-layer. We have surprisingly found that fusion of a foreign amino acid 
sequence to at least a functional portion of a S-layer protein does not 
prevent the proper folding of the foreign sequence. The foreign sequence is 
thus presented on the surface of host cell and can be recognised by the 

35 immune system of a host, human or animal. 

The foreign sequence can also be presented on the surface of sacculi. 
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Sacculi, sometimes termed native sacculi or ghosts, are devoid of cytosolic 
and membrane proteins. They consist mainly of the peptido-glycan outer 
layer of bacterial cells surrounded by the S-layer. They can be derived 
from host cells according to the invention by simple procedures (Sara and 
Sleytr, 1987). For example, host cells may be sonicated, a detergent such 
as Triton X-100 added and the mixture incubated. After washing, the 
treated cells can then be incubated with DNAse and RNAse. The resulting 
sacculi are washed again. 

The host cell or sacculi derived therefrom can therefore be used as a 
vaccine. The host cell may be a non-pathogenic bacterium. It may be a 
bacterium which is naturally non-pathogenic or it may be an attenuated 
bacterium for this purpose, i.e. an attenuated form of a pathogenic 
bacterium. An attenuated bacterium typically contains one or more 
rationally directed mutations that prevent extensive spreading of the 
bacterium within the host to which the bacterium is administered. The 
bacterium can however still establish a limited infection leading to the 
stimulation of a natural immune response (Charles and Dougan, 1990). 

A pharmaceutical or veterinary composition may therefore be provided 
which comprises a host cell provided with a S-layer comprising a fusion 
polypeptide according to the invention and a pharmaceutically or 
veterinarily acceptable carrier or diluent. The composition may be 
formulated as a vaccine. The composition may be administered orally, 
intranasally or parenterally such as subcutaneously or intramuscularly. The 
dosage employed depends on a number of factors including the purpose of 
administration and the condition of the patient. When the host cell is a 
bacterium, typically however a dose of from 10 9 to 10 u bacteria is suitable 
for a human or animal for each route of administration. 

The composition may be in lyophilized form. The composition may be 
formulated in capsular form. The capsules may have an enteric coating for 
oral administration, comprising for example Eudragate "S", Eudragate "L", 
cellulose acetate, cellulose phthalate or hydroxypropylmethyl cellulose. 
These capsules may be used as such or alternatively, the lyophilised 
material may be reconstituted prior to administration, e.g. as a suspension. 
Reconstitution is advantageously effected in a buffer at a suitable pH to 
ensure the viability of the organisms. In order to protect the bacteria from 
gastric acidity, a sodium bicarbonate preparation is advantageously 
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administered before each administration of the composition. 

The presentation system of the invention has applicability beyond use as 
live bacterial vaccines. The heterologous polypeptides which are presented 
on the surface of host cells thus remain bound to the cells, so the 
5 presentation system may be used for screening proteins and antigens, and 
the system can also be used as a support for immobilising an enzyme, 
peptide and/or antigen (Georgiou et al, 1993; Smith et al 1993). 

Host cells according to the invention may therefore be used for display 
of antibodies and peptide libraries. A bacterial selection system 
10 complementary to phage display technology can thus be produced. The 
bacterial library can be separated by affinity chromatography. 

A host cell displaying on its surface a heterologous polypeptide of 
interest can also be used to raise antibody against that polypeptide. 
Polyclonal antibody can be raised by, for example, administering the host 
15 cell to a mammal. The mammal may be an experimental animal such as a 
rabbit, mouse or rat. Antisera can be obtained from the immunised 
mammal. 

Monoclonal antibodies can be obtained by adaptation of conventional 
procedures. A mammal is immunised with a host cell according to the 

20 invention, cells of lymphoid origin from the immunised mammal are fused 
with cells of an immortalizing cell line and thus - immortalized cells which 
produce antibody specific for the heterologous polypeptide of interest are 
selected. The selected cells are cultured to obtain quantities of the desired 
monoclonal antibody. 

25 In more detail, hybridoma cells producing monoclonal antibody may be 

prepared by fusing spleen cells from an immunised animal with a tumour 
cell. The mammal which is immunised may be a rat or mouse. The 
hybridomas may be grown in culture or injected intraperitoneally for 
formation of ascites fluid or into the blood stream of an allogenic host or 

30 immunocompromised host. Human antibody may be prepared by in vitro 
immunisation of human lymphocytes with respect to the peptide or a 
fragment thereof, followed by transformation of the lymphocytes with 
Epstein-Barr virus. 

The presentation system of the invention can further be employed as a 

35 whole-cell adsorbent. The expression of a heterologous polypeptide as part 
of the S-layer fusion polypeptide on the surface of host cells enables the 
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host cells to be employed as an affinity adsorbent. Host cells may also be 
used to present an enzyme as the heterologous polypeptide, thus acting as 
biocatalysts. 

As a consequence of cloning and sequencing the gene encoding the SLP 
5 of B. sphaericus P-l, in another aspect of the invention we have identified 
three promoters associated with the gene. One of these promoters is 
capable of directing a three-fold higher expression level than the wild-type 
promoter. A putative promoter previously indicated by Bowditch et a] 
(1989) was found incapable of directing expression. 

10 The present invention therefore additionally provides a first promoter 

having a -35 region of the sequence TTGAAT and a -10 region of the 
sequence TATATT. The critical parts of promoters are believed to be the - 
35 and -10 regions (Watson et al, 1987). According to the numbering 
scheme used, the DNA nucleotide encoding the beginning of the mRNA ,£ 

15 chain is 4-1. 

Typically there are 16 to 18 nucleotides between the -35 and -10 f 
regions. Preferably the intervening nucleotides are 
TTCGGAAAAGATAGTGT. A useful promoter has the sequence 
CTAAATTTATGTCCCAATGC TTGAATT TCGGAAAAGATAGTGT 

20 TATATT ATTGT. The -35 and -10 regions are underlined. 

A promoter having -35 and -10 regions of the sequences TTGAAT and 
TATATT, respectively, is the promoter having a transcription initiation site 
identified herein as PI (see Figure 10 of the accompanying drawings). This 
promoter is capable of directing expression at higher levels than the 

25 promoters having transcription initiation sites identified herein as P2 and P3 
(Figure 10) or than the entire wild type promoter sequence shown in Figure 
10 incorporating all of the three promoters. The PI promoter is in fact 
three-fold stronger but only when used alone, i.e. when separated from the 
P2 and/or P3 promoters. 

30 The invention also provides a second promoter having a -35 region of 

the sequence CTTGGTT and a -10 region of the sequence TATAAT. 
Typically there are 16 to 18 nucleotides between the two regions. 
Preferably the intervening nucleotides are ATTATTGAGAGTAAGG. A 
useful promoter has the sequence 

35 TCCAGAAAATG CTTG GTTATTATTGAGAGTAAGG TATAATA GGTA , 
the -35 and -10 regions being underlined. 
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The invention additionally provides a third promoter having a -35 
region of the sequence ATTACGGGA and a -10 region of the sequence 
TTTAGT. Typically there are 16 to 18 nucleotides between the two 
regions. Preferably the intervening nucleotides are 
5 GTCTTTAATTTTTGACAA. A useful promoter has the sequence 

AAA A TATTACGGGA GTCTTTAATTTTTGACAA JTTTACTAACCAT, 

the -35 and -10 regions being underlined. 

The three promoters may be tandemly arranged, for example in the 
order of the third promoter, the second promoter and the first promoter in 

10 the 5' to 3' direction. This is the order in which the three promoters occur 
in the wild-type promoter of B. snhaericus P-l shown in Figure 10. Useful 
DNA fragments incorporating the promoters according to the invention are 
the following DNA sequences shown in Figure 10, using the number system 
employed in that Figure: 

15 - nucleotides 52 to 353; 

- nucleotides 1 to 353; 

- nucleotides 1 to 406; and 

- nucleotides 1 to 455. 

The promoters can be used to direct expression of a 
20 heterologous protein in a host, for example a bacterial host such as a gram- 
negative or gram-positive bacterium. Suitable host cells are therefore as 
described above. The invention therefore provides: 

(a) an expression vector which comprises a promoter according to the 
invention and a downstream cloning site into which a DNA sequence 

25 encoding a heterologous protein may be cloned such that the promoter 

is operably linked to the said sequence; 

(b) an expression vector which comprises a promoter according to the 
invention operably linked to a DNA sequence encoding a heterologous 
protein; and 

30 (c) a DNA fragment comprising a promoter according to the invention 
operably linked to a DNA sequence encoding a heterologous protein. 
An expression vector (a) or (b) can include any suitable origin of 
replication which will enable the vector to replicate. A ribosome binding 
site is provided. The ribosome binding site is suitably located between the 
35 promoter and the cloning site or the DNA sequence encoding the 

heterologous protein as the case may be. If desired, a selectable marker 
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gene such as an antibiotic resistance gene can be provided in the vector. 
The vector is generally a plasmid. 

The cloning site of vector (a) may be provided at a translational start 
codon such as a Ncol site. Alternatively, no translational start codon may 
be provided in the vector. In that event, the foreign gene to be inserted into 
the cloning site would need to be provided with such a codon. Typically 
the gene inserted into the cloning site is provided with a translational stop 
codon. 

Both vectors (a) and (b) are normally provided with a transcriptional 
termination sequence. The DNA sequences of vector (b) and of the DNA 
fragment mentioned above are provided with translational start and stop 
codons. As in the case of vectors (a) and (b), the DNA fragment will 
typically incorporate a ribosome binding site downstream of the promoter. 
The DNA fragment may be single- or double* stranded, depending on its £ 
purpose. 

Vectors (a) and (b) may be constructed by assembling all appropriate 
elements using techniques known in the art (Maniatis et al, 1982). For 
example, vector (b) may be obtained by cloning a DNA sequence encoding 
a heterologous protein into vector (a) at the cloning site of that vector or by 
cloning a DNA fragment (c) into an expression vector provided with an 
origin of replication. The cloning site of vector (a) may be introduced by 
oligonucleotide-directed mutagenesis or polymerase chain reaction (PCR) - 
mediated site specific mutagenesis. The elements of vectors (a) and (b) are 
operably linked. The recombinant DNA fragment (c) may be constructed 
by ligating a foreign gene to a promoter sequence according to the 
invention. 

The DNA sequence encoding a heterologous protein may be provided 
immediately downstream of a DNA sequence encoding a signal peptide 
responsible for polypeptide secretion which in turn may be provided 
immediately downstream of the translational start codon. The signal 
peptide-encoding DNA sequence may encode any signal peptide capable of 
directing secretion of polypeptides from gram-positive bacterium. Typically 
the amino acid sequence of the signal peptide ends ValAlaSerAIa. 

The heterologous protein may be a heterologous peptide as described 
above. 

The following Examples illustrate the invention. In the accompanying 
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drawings: 

Figure 1 shows the characterization of pGVPl. 

A. Restriction map of pGVPl for Hindm (1), PstI (2), Sspl (3), that have 
double-occurring restriction sites. The single-occurring sites are 

5 indicated on the outside of the map. 

B. Identification of a circular single-stranded DNA molecule as a 
replication intermediate of pGVPl. 

Panel l.Ethidium bromide-stained 1% agarose gel. Lane A, non- 
digested total DNA of B. sphaericus P-l ; lane B, Hhal digested P-l 
10 total DNA; lane C, SI nuclease-treated P-l total DNA; lane D, P-l 

total DNA treated with T 4 DNA polymerase. 

Panel 2. Hybridization between 32 P-labeUed pGVPl and a non-denatured 
Southern blot of the gel in panel 1 on a nitrocellulose membrane. A 
specific hybridization signal, corresponding to single-stranded DNA, 
15 was observed only in non-digested (lane A) and T 4 DNA polymerase- 

treated P-l DNA (lane D), but not in Hhal (lane B) or SI nuclease- 
treated total P-l DNA (lane C). 

Panel 3. Hybridization between 32 P-labelled pGVPl and a Southern blot 
of a similar gel as in panel 1 , but denatured prior to transfer to a 
20 nitrocellulose membrane. In all lanes, a hybridization signal, 

corresponding to double-stranded DNA can be observed. In lanes A 
and D, additionally the signal corresponding to single- stranded DNA 
can be observed. 

Figure 2 shows the results of plasmid analysis of Em-resistant B, 
25 sphaericus P-l transformed by pEL253. Central panel. Hindm-digested 
plasmid preparations of Em R B. sphaericus P-l, transformed by pDL253 
(lanes 1-4), separated by agarose gel electrophoresis (ethidium bromide 
stained). Left panel. Autoradiogram of the hybridization between 32 P- 
labelled pIL253 and a Southern blot of the gel in the cental panel. In all 
30 transformants, fragments specific for the introduced pEL253 plasmid (3.9 kb 
and 0.9 kb) are revealed. Right panel. Autoradiogram of the hybridization 
between the same blot as in the left panel, and 32 P-labelled pGVPl. 
Specific fragments (2.3-0.5 kb) from the endogenous pGVPl are revealed. 
Figure 3 demonstrates the electrocompetence in 2L sphaericus P-l in 
35 late- stationary phase. B. sphaericus P-l cells were incubated at 37° C in 

100 ml LB broth on a gyratory shaker for 48 hr. Every 6 hr a sample was 
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withdrawn from the culture and colony-forming units/ml were determined. 
Cells were pelleted by centrifugation, washed and resuspended in 1 ml of 
distilled H 2 0. After addition of pHL253 DNA, cells were electroporated in 
0.2 cm gapped electrocuvettes (E 0 12 kV/cm, R= 200 0), diluted in 900 ^1 
LB, incubated for 1 hr at 37oC and plated on LB plates with erythromycin. 

Figure 4 is a schematic representation of the high-copy number (pSL40) 
(A) and low-copy number (pSL84) (B) bifunctional vectors for E. coli and 
B.sphaericus spp. Restriction sites are indicated with their relative position 
on the physical map. Abbreviations used: bla: ^-lactamase; MLS: 
resistance to the macrolide-lincosamide-streptogramin B group of antibiotics; 
Orf E to G: open reading frames involved in replication in Gram-positive 
hosts. pSL40 was constructed by ligating the 2.6-kb EcoRI/Xbal fragment 
of pLK68 in pIL253 (EcoRI/Xbal- linearized). After restriction and filling- 
in at the EcoRI site of the resulting plasmid, the small multicloning site was 
exchanged for the polylinker of pJB66 by substituting the respective v 
Xbal/BglH fragments. pSL84 was constructed by substituting the 1.8j-kb 
Pstl/Sall fragment of pSL40 for the 1.3-kb Pstl/Sall fragment of 
pACYC177 (containing the low copy number-origin of replication- for E. 
coil). In the resulting plasmid the 2.3-kb Nsil/Xbal fragment was replaced 
by the corresponding Nsil/Xbal fragment of pEL252 containing the low-copy 
number origin of replication for Bacillus spp. 

Figure 5 is a restriction map of the genomic region containing the gene 
encoding the S-layer protein of B. sphaericns P-l. The black bar represents 
the signal peptide-encoding sequence; the hatched bar shows the mature 
part of SLP. The inserts of the four overlapping subclones used for 
sequence analysis are depicted below the restriction map by open bars. The 
arrow indicates the direction of transcription. 

Figure 6 shows the DNA sequence and deduced amino acid sequence of 
the s]p_ gene of B.sphaericus P-l . The putative ribosome-binding site 
preceding the SLP ORF is double underlined. The shaded residues 
represent the signal peptide. The mature SLP thus commences with amino 
acid residue 31. Potential N-linked glycosylation sites are underlined. The 
stem of the Rho-independent transcription termination signal after the 
translation stop codon is indicated by arrows. The NH 2 -terminal amino acid 
sequence determined by automated microsequence analysis of the purified 
mature SLP is indicated by a dotted line. 
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Figure 7 shows the hydropathic profile of the R^phaericus P-l S-layer 
protein by the computerized method of Kyte and Doolittle (1982). 
Horizontal bars represent potential transmembrane helices, as predicted by 
the method of Rao and Argos (1986). 
5 Figure 8 shows the sequence of the NH 2 -terminal portion of the SLP of 

B s phaericus P-l and the larvicidal strain 2362. The signal peptide 
sequence of both proteins is boxed. Adjustments (horizontal bars) were 
introduced for optimal alignment. 
Figure 9 : 

10 A. Northern blot analysis of the ilp-encoded mRNA in B^haencys P-l . 
Total cellular RNA was isolated from different growth phases (see 
text). The internal 1 .81 kb Hpal fragment of the sip gene was used for 
the generation of a "P-labeUed probe. Migration pattern of molecular 
mass markers as indicated. 

15 B. Primer extension analysis of the transcriptional initiation sites of the 
sip-encoded transcripts. Two different primers were used (see Figure 
10 and text for more details). 

Figure 10 shows the DNA sequence of the promoter region controlling 
sip expression. The position of the 5' ends of the transcripts, as determined 
20 by primer extension analysis, are indicated (black inverted triangles). The 
putative ribosome-binding site, preceding the sip ORF (shaded sequence) is 
indicated by dots. Primers used for primer extension assays were 
complementary to the overlined sequences. The exact end points of the 
different deletion mutants (pSL151 to pSL159) are shown by an arrow. 
25 Potential -10 and -35 boxes preceding the transcription initiation sites are 
indicated. The putative -10 and -35 regions as reported by Bowditch et_aL 
(1989) are marked by an asterisk. 

Figure 1 1 shows how jS-glucuronidase activity is directed by the 
different sip promoter deletion mutants in B. sphaericus P-l (hatched bars). 
30 In pSL87 the uidA gene is under control of the 1380 promoter. 

Figure 12 shows the effect of Ca 2+ cations on the sip promoter-directed 
/^-glucuronidase activity in different deletion mutants. Black bars represent 
R s phaericus P-l cells grown in IJB medium. Hatched bars indicate cells 
grown upon addition of 7 mM CaCl 2 . Activity was measured 4 hours after 
35 addition of CaCl 2 to the culture. 

Figure 13 shows the general outline of the strategy for disruptive single 
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homologous recombination. AB, R : antibiotic resistance marker 1 
(Em R );AB2 R : promoterless antibiotic resistance gene 2 (oetll); ori: origin of 
replication; wavy line: RNA transcript; P: promoter. 
Figure 14: 

5 A. Restriction map of the Afunctional plasmid pSL64 used as based for the 
construction of different intermediate vectors. Restriction sites are 
indicated with their relative position on the physical map. 
Abbreviations used: bla: /S-lactamase; MLS: resistance to the 
macrotide-lincosamide-streptogramin B group of antibiotics; OrfE to 
10 G: open reading frames involved in replication in Gram-positive hosts 

(Swinfieldet aL, 1990). 
B. Nucleotide sequences upstream from the nptll-coding region (boxed) of 
pSL64 andpSLlOl. 
Figure 15 : 

15 A. Schematic representation of carboxy-terminal truncated SLPs obtained 
by single homologous recombination of the different intermediate 
vectors. Central block represents the restriction map of the 
chromosomal region containing the sip gene. Hi: Hindm; Hp: Hpal; 
Bg: BglH; Pv: PvuII; Xb: Xbal. Black arrows represent SLPs in the 

20 wild-type and recombinant P-l strains as indicated on the left. 

Calculated molecular masses of the SLPs are indicated on the right. 
Striped bars indicate the used subclones of sip gene. White bars 
indicate internal sip fragments, cloned in pSL64 in the different 
intermediate vectors. 
25 B. SDS-PAGE of proteins from wide-type P-l (lanes A), recombinant 
strains P-l::pSL66 (lanes B), P-l::pSL68 (lanes C) and P-l::pSL69 
(lanes D). lane M: high-molecular mass markers (Bio-Rad). Proteins 
are either TCA-precipitated from the supernatant of the cultures or are 
obtained by sonication and centrifugation. The insoluble fraction is 
30 indicated by debris, whereas the soluble fraction is indicated as 

sonicate. 

Figure 16 is an autoradiogram of the hybridization between 32 P-labelled 
pSL20 and BglH-digested total DNA of P-l (lane A), P-l::pSL69 (lane B), 
and P-l:: pSL102 (lane C). In P-l:: pSL69 and P-l::pSL102, the 1600 BP 
35 BglH fragment, hybridizing with P-l total DNA has disappeared, whereas 
two predicted fragments of 2600 and 6500 bp appeared. 
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Figure 17 is a schematic representation of the peptides translation^ 
fused to carboxy-terminally truncated SLPs in the strains P-l::pSL102, P- 
1- P SL113 and P-l::pSLUl. Central block represents the restriction map 
of the chromosomal region containing the sip gene, Hi: Hindni; Hp: Hpal; 
5 Bg- Bgffl- Pv: PvuH; Xb: Xbal. Arrows under the restriction map 
represent 'recombinant SLPs after integration of intermediate vectors 
indicated above the restriction map. Black fragments represent SLP 
portion, striped bars indicate the SI subunit of pertussis toxin, and white 
bars represent NPTTL 

10 Figure 18 : 

A SDS-PAGE of total protein extract from P- 1 : :pSLl 1 3 (lane 1 ) , P- 
l::pSL102(lane2),andP-l:: P SL69(lane3). Respective SLPs are 
indicated by a (130 kDa), b (102 kDa) and c (74 kDa). 
B Immunodetection on a Western blot of the gel in panel A, using ant>- 
15 NPTQ antibodies. Two recombinant SLPs (indicated a and b) are 

revealed. 

C. SDS-PAGE of total protein extract from P-l::pSLlll (lanes 1-3), 

P-l--pSL102 (lane 4) and P-l::pSL69 (lane 5). 
D Immunodetection on a Western blot of the gel in panel C, using anti-PT 
" antibodies. Two recombinant SLPs (of 90 and 120 kDa) are revealed 
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25 



(d* and d, respectively). 

Figure 19 is an autoradiogram of in gel kanamycin phosphorylation 
assay on protein extracts separated by non-denaturing polyacrylamide gel 
electrophoresis extracted from P-l (panel A) P-1::PSL69 (panel B), and 
P-l::pSL102 (panel C). Significant phosphorylating activity in the high- 
molecular mass region can only be observed in P-l::pSL102. 

Figure 20 shows immunogold labelling on intact bacteria using anti- 
NFTn antibodies. Panel A, P-l::pSL69, panel B, P-S:: P SL102; panel C, 
P-2::pSLU3. Significant accumulation of gold-label can only be observed 
in P-a :pSL102 and to a lesser extent in P-l::pSL113. 

Figure 21 shows the detection of FT subunit SI and NPTH in native 
saccuU prepared from P-l::PSL102 and P-1:: P SL111. CE: cellular 

extracts; NS: native saccuii. 
Example 1 
35 1 Materials a nd Methods 

p.,^,, c, ra in< ,nd nlasmids. In Table I, the bacterial strains and 
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plasmids used in this study are listed. R sphaericus strains were grown in 
Luria-Bertani (LB) broth (Miller, 1972), supplemented with 0.7% agar for 
solid media. Selective antibiotic concentrations for B. sphaericus were: 
10 fig/ml erythromycin (Em); 10 /xg/ml nalidixic acid (Na). For R coli . 
5 200 ng/ml of triacillin was used. 

Table L 

Bacterial strains and plasmids used in this study 
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Characteristics . 


References 


E. coli 






DH5a 


F, 08OdlacZAMl5, A(lacZYA-argF)y, 69 , 
recAl, endAl, hsdR17(r t %m k + ), supE44 


Hanahan 
riQR3 , > 


MC1061 


hsdR, hsdM, hsdS, araD139, 
A(ara-leu) 7697 , Mac x74 , galU, galK, 
rpsL 


Casadaban 
and Cohen 
(1980) 


R snhaericus 




P-1 


Nalidixic acid resistant 


Lewis et al. 
(1987) 


1593 




BGSC 


10208 




ATCC 


T jirtoroccus lactis 




MG1363 




Gasson and 
Davies (1980) 


Pin cmifte 






pGVPl 


natnrpi Jcnlatp. in R sphaericus P-1 


This study 


pGVP2 


cointegrate of Bamffl-linearized pUC9 
and Bg III- linearized pGVPl 


This study 


pUC9 


Ap R 


Vieira and 

Messing 

(1982) 


pPGV5 


bifunctional, Ap R , Nm R 


This study 


pJB66 


Ap R 


Botterman 
and Zabeau 
(1986) 


pSL40 


cointegrate of pJB66 and pEL253, Ap R , 
MLS R 


This study 


pSL84 


cointegrate of pACYC177 and pIL252, 
Ap R , MLS R 


This study 


p AM/31 


MLS R , autotransmissible, natural 
isolate 


Clewell et. 
al (1974) 


pIL252 


MLS R , low-copy number vector derived 
from pAM/31 


Simon and 

Chopin 

(1988) 


pIL253 


MLS R , high-copy number vector derived 
from p AM/31 


Simon and 

Chopin 

(1988) 


pACYC177 Ap R , Km R , low-copy number vector 


Chang and 
Cohen (1978) 
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Transformation and plasmids . Competent E. coli strains were prepared 
and transformed according to Kushner (1978). Transformation of R 
sphaericus P-l intact cells was achieved by electroporation using the 
protocol developed as described in the Results section below. B. sphaericus 
5 P-l cells, grown in LB broth for 42 hr at 37°C on a gyratory shaker, were 
harvested by centrifugation (9000 g), washed with ice-cold distilled H 2 0, 
resuspended in 1/10 volume of a 10% glycerol solution in distilled H 2 0,' 
aliquoted in 100 pel samples, and stored at -70°C. 

For transformation, samples were quickly thawed, mixed with DNA, 
10 and transferred into 0. 1 cm gapped electrocuvettes. An electrical pulse (14 
kV/cm, 25 fiF) was delivered, using a GenePulser (Trade Mark) apparatus 
(Bio-Rad laboratories) with Pulse Controller extension set at 200 fi. After 
the electrical pulse was delivered, cells were diluted with 900 M l of LB 
broth and incubated at 37o C for 1 hr, prior to plating on solid LB medium, 
15 and supplemented with appropriate antibiotics. 

General recombinant DNA tgchnig ues. E. coli plasmid DNA was prepared 
according to Sambrook et al. (1989), whereas for plasmid and total DNA 
preparations ofB. sphaericus, cells were pretreated with lysozyme (100 
/xg/ml) at 37°C for 10 min. Restriction enzymes were purchased from New 
England Biolabs, Pharmacia (Uppsala, Sweden) or Bethesda Research 
Laboratories, and were used according to the manufacturers' 
recommendations. 

Elution of DNA restriction fragments was done using GeneClean H 
(Trade Mark) kit (BiolOl Inc., La JoUa, California, US). Filling-in of 
protruding single-stranded termini after restriction enzyme digestion (using 
Klenow or T 4 DNA polymerase) and ligations were done according to 
standard conditions (Sambrook ej ah, 1989). Southern transfer and 
hybridization were performed using Hybond N + membranes (Amersham) 
and QuickPrime (Trade Mark) labelling kit (Pharmacia) to prepare 32 P- ' 
labelled probes, except for blotting of single-stranded DNA, which was 
achieved using nitrocellulose membranes. 
2. Results 

Characterization of endogenous pjasmjds of B. snhaerims P-l Plasmid 
preparations, according to the alkaline lysis method, followed by 
equibbrium density gradient centrifugation, revealed the presence of a small 
plasmid (2.8 kb), designated pGVPl, in B. sphaericus P-l. By preliminary 
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restriction analysis of pGVPl, a unique restriction site for Bgffl was found. 
For further restriction enzyme analysis, a cointegrate plasmid (pGVP2) was 
constructed, by joining BgUI-linearized pGVPl and BamHI-linearized 
pUC9, allowing large-scale preparations from E^coli. Single- and double- 
5 occurring restriction enzyme sites (Bgffl, Aval, Ncol, PstI, HindTU, Sspl) 
were ordered by appropriate double digestions (Fig. 1A). No sites were 
founds for Kpnl, BamHI, EcoRI, Apal, Clal, EcoRV and Sphl. 

Hybridizations were performed between 32 P-labelled pGVPl and 
Southern transfers of non-denatured total DNA of B, .s phaericus P-l to 
10 nitrocellulose membranes (a frequently used method for detection of single- 
stranded replication intermediates in Gram-positive replicons; Gruss and 
Ehrlich, 1989). A specific hybridization signal was observed in undigested 
total DNA (Fig. IB, panel 2, lane A), corresponding to a single-stranded 
intermediate. The signal was not detected in DNA samples which were 
15 digested either with SI nuclease or with the single- stranded DNA-cleaving 
endonuclease Hhal, prior to gel separation. 

Treatment with T 4 polymerase, which degrades specific linear single- 
stranded DNA, did not decrease the signal, indicating that the pPGVl 
replication intermediate is circular. As it is of considerable interest to use 
20 B. sphaericus P-l as a host for transformation experiments several methods 
for plasmid curing that proved successful in Gram-positive bacteria 
[including novobiocin (Gonzales et al., 1981), rifampin (Johnston and 
Richmond, 1970), sodium dodecyl sulfate (Sonstein and Baldwin, 1972)] 
were tried out but did not result in the production of a plasmid-free strain. 
25 introduction of pAMfll into B sphaericus P-1 by intergeneric 

conjugation . To test whether the macrolide lincosamide steptogramin B 
(MLS) resistance determinant and the origin of replication of the auto- 
transmissible plasmid pAM/31 (26.5 kb) were functional in B. sphaericus P- 
1, this plasmid was introduced into P-l by conjugating it with Lactococcus 
30 lactis MG1363 [pAM01] (Gasson and Davies, 1980). This plasmid was 
chosen because it had previously been introduced successfully into B, 
s phaericus 1593 (Orzech and Burke, 1984). After overnight incubation of a 
mixture of both strains (ratio 1:1) on nitrocellulose filters placed on M17 + 
lactose medium at 37°C, bacteria were collected and several dilutions were 
35 plated on LB medium supplemented with Em. 

Em-resistant B. s phaericus P-l colonies were obtained at a frequency of 
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3 x 10" 6 (transconjugants/acceptor strain). After colony purification on LB 
medium supplemented with erythromycin and nalidixic acid, putative 
transconjugants were analyzed for the presence of pAM/?l by Southern 
hybridization using 32 P-labelled p AM/31 as a probe (data now shown). The 
5 plasmid was stable for several generations, even in the absence of selective 
pressure. These results prompted us to use pAM/Sl -derived cloning vectors 
(e.g. pDL253) for electrotransformation experiments in P-l. 

Electrotra nsformation . Initial experiments using pDL253 to transform 
B. sphaericus strains, following reported protocols for electrotransformation 
10 of several Bacilli (Takagi et ah, 1989; Bone and Ellar, 1989; Taylor and 
Burke, 1990) were unsuccessful. The common denominator in these 
protocols is the use of cells harvested in early or mid-log growth phase. 
However, using cells harvested from late-log solid-grown colonies, which 
were washed once with ice-cold distilled H 2 0, and standard electrical ' 
15 parameters for E. coli (12kV/cm, 200 0, 25 /xF in 0.2 cm gapped 

electrocurvettes), 10 2 transformants were obtained. Plasmid analysis ** 
revealed the presence of two plasmids that could be identified as the 
endogenous pGVPl and the introduced pEL253 by Southern hybridization 
(Fig. 2). 

20 To optimize the physiological conditions for electroporation of P-l , 

cells harvested at different time-points in a growing culture were washed 
once and resuspended in 1/10 volume distilled H 2 0, and electroporated (at 
. 12 kV/cm, 200 0, 25 /xF in 0.2-cm gapped electrocuvettes). Although 
growth of the culture stagnated after 8 hr, transformants were not obtained 

25 until 36 hr of incubation. The number of transformants reached a maximum 
at 42 hr incubation, and significantly decreased after 48 hr of incubation, 
presumably due to cell death (Fig. 3). This phenomenon demonstrates the 
need for a certain physiological state of the bacterial cells required for 
electrotransformation, or in other words electrocompetence. 

30 Electrocompetence has been inferred to explain saturation of transformation 
efficiencies at increasing DNA concentration (Chassy et al., 1988; Desomer 
et aL, 1990). 

Addition of different chemicals (used to increase transformation 
efficiency in protocols for different bacterial species) to the electroporation 
35 medium, such as polyethylene glycol (PEG) 1000 (15% w/v) and glycerol 
(10% w/v) improved the transformation efficiency significantly. (Table II). 
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Variation of the electrical parameters included transformation at higher 
voltages (12, 14 and 16 kV/cm in 0.1-cm gapped electrocuvettes) and use of 
different external resistances (200, 400 and 600 0). Maximum 
^formation efficiencies were obtained at 14 KV/cm. 25 ,F, 200 0 usmg 
5 0 1-cm gapped electrocuvettes (Table H below). 

Comb^Ln of both improved protocols, as described in the Matenals 
and Methods section above, routinely yielded 10 s transformants per M g 
DNA. Cells could be kept frozen at -7 0 o C without significant loss of 
eleetrocompetence. 

10 The high transformation efficiency obtained by this protocol prompted 

us to test whether plasmids with single-stranded replication intermediates 
(such as the pUBl 10-derived vector pPGVS) could be used as transforming 
DNA, and eventually yield a P-l strain, cured of pGVPl by mcompatibihty. 
pPGVS is a cointegrate via the EcoPJ site of P UC4 (Vieira and Messing, 
15 1982) and pPL703 (Mongkolsuk et al, 1983). Nm-resistant transformants 
" were obtained with low frequency (Table H below) and contained intact 
pPGVS in addition to the endogenous pGVPl (data not shown). 
" Application of the same protocol to B^phaenjyis 1593 and ATCC 
10208 yielded no transformants (Table U below). Indeed, a previously 
20 published protocol for ^transformation for B.^haericus 1593 used 
cells harvested in early-log growth-phase (Taylor and Burke, 1990). 

C^ttucrion of Afuncti onal vectors for E. roli and B. sphaencus . 
Bifunctional vectors that can replicate in both R^phaericus and E^cph 
have the advantage that cloning procedures and analysis can be done with 
25 well established methods in R_coji prior to introduction of the final 

construct into B snhaericus . Bifunctional plasmids were constructed with 
high- (pSU40) and low-copy number (pSL84) in both hosts. pSLAO contains 
the multilinker, ampicillin-resistance gene and the ColEl origin of 
replication of P JB66 (Botterman and Zabeau, 1980) as well as the MLS 
30 determinant and origin of replication of P IL253 (Fig. 4A). In pSL84 the 
high-copy number origin of replication of pJB66 is exchanged for the low- 
copy number origin of pACYC177, whereas the origin of pIL253 is 
replaced for that of P IL252, an ancestral plasmid of pIL253 with low-copy 
number (Fig 4B). Upon introduction in IL sphaencus P-l, both plasmids 
35 exhibited the expected copy number control. No significant difference in 
transformation efficiency was observed using P SL40 or P SL84 prepared 
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from either the E. coli MCI 061 or the B.sohaericus P-l hosts (Table U 
below). 

Table n 

Transfor mation efficiencies using different electrical 
conditions. R s phaericus strains and p lasmiris 



Strain 


Electropo ration 
medium 


E c 
kV/cm 


R 

o 


P-l 


H 2 0 


0 


200p 


Jr-1 


TT /""V 

H 2 0 


6 


200 


P-l 


H 2 0 


10 


200 


P-l 


H 2 0 


12 


200 


P-l 


H 2 0 


12* 


200 


P-l 


H 2 0 


14 1 


200 


P-l 


H 2 0 


16* 


200 


P-l 


H 2 0 


14* 


400 


P-l 


H 2 0 


14" 


600 


P-l 


30% PEG 1000 


12 


200 


P-l 


15 % PEG6000 


12 


200 


P-l 


15% PEG1000 


12 


200 


P-l 


10% glycerol 


12 


200 


P-l 


10% glycerol 


14* 


200 


P-l 


H 2 0 


12 


200 


1593 


H 2 0 


12 


200 


10208 


H 2 0 


12 


200 


P-l 


10% glycerol 


14* 


200 


P-l 


10% glycerol 


14* 


200 



Plasmid 



Transformation 
efficiency" 



DL253 

pEL253 

plL253 

pIL253 

pDL253 

pIL253 

pIL253 

pIL253 

pEL253 

pIL253 

pIL253 

pEL253 

pIL253 

pIL253 

pPGV5 b 

pIL253 

pIL253 

pSL40 

pSL40 b 



<10' 
2.0 x 

3.4 x 

8.5 x 

9.6 x 

6.7 x 

8.6 x 

5.3 x 

4.7 x 
<10' 
4.0 x 

2.4 x 

1.4 x 

6.5 x 
8.0 x 
<10' 
<10' 
2.0 x 

2.6 x 



10 1 
10 2 
10 2 
10 3 
10 3 
10 2 
10 3 
10 2 

10' 
10 2 
10 3 
10 s 
10' e 



10 3 
10 3 



* in 0.1-cm gapped electrocuvettes; b plasmid source was E. coli : c selected 
on LB with Nm; 6 number of transform ants selected on LB EM per fig of 
DNA. 
Example 2 

1. Materials and Methods 

Bacterial strains, plasmids and mftdia Bacterial strains used are 
B.sphaericus P-l (Lewis et al, 1987) and B.subtilis BR151 (Bacillus Genetic 
Stock Center, Ohio State University, Columbia). E. coli hosts were either 
DH5or (Han ah an, 1983) or MC1061 (Casadaban and Cohen, 1980). 
Cloning was performed in pUC18 (Yanisch-Perron et al, 1985). pSL40 
(Example 1) is a high copy number E. coli-Bacillus shuttle vector, 
essentially composed of pJB66 (Botterman and Zabeau, 1987) and pEL253 
(Simon and Chopin, 1988). The ^-glucuronidase (uidA) gene cassette was 
isolated from pGUSl (Peleman et aj, 1989) and introduced into pSL40 as a 
BamHI/SphI fragment, yielding pSL150. Bacteria were grown on LB 
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medium (Miller, 1972) solidified with 1.5% agar, whereas liquid cultures 
were grown in TB medium (Tartof and Hobbs, 1987). When required 
antibiotics were added: ampicillin (100 M g/ml) or erythromycin (10/xg/ml). 
All cultivations were performed at 37°C. 

5 dna techniques. Recombinant DNA techniques for JL_coh were 

performed according to standard conditions (Sambrook et al, 1989). 
Restriction and modifying enzymes were purchased from Pharmacia, New 
England Biolabs, Promega or Bethesda Research Laboratories and used 
according to their recommendations. DNA fragments were purified from 

10 agarose gel using the Gene Clean Kit (Bio-101 Inc.). DNA sequences from 
both strands were determined by the dideoxy-chain termination method 
(Sanger et aj,l977) using the T7 Sequencing Kit (Pharmacia). Sequence 
analysis was carried out with the Intelligenetics suite of program 
(Intelligenetics Inc.). Databases were screened by FASTDB software 

15 (Brutlag et al, 1990). Unidirectional deletions were generated by combined 
Exom/Sl nuclease activity, using the Double-stranded Nested Deletion Kit 
(Pharmacia). Oligonucleotides were synthesized on an ABI 394 DNA/RNA 
Synthesizer (Applied Biosystems Inc.). High voltage transformation of R 
coU DH5« with ligation mixtures was done with a Bio-Rad Gene Pulser 

20 (Trade Mark). Site-specific mutagenesis using polymerase chain reaction 
(PCR) was performed as described (Landt et al, 1990). 

r^f^runn and — ""P libraries. B. s phaericus P-l genomic 
DNA was prepared as described by Mielenz (1983) and digested to 
completion with the appropriate restriction enzyme. Libraries were 

25 constructed in digested and dephosphorylated pUC18, according to standard 
conditions (Sambrook et al, 1989). 3840 colonies were transferred to 
Hybond-N nylon membranes (Amersham International) and screened by 
colony hybridization under standard stringency conditions. "P-labelled 
probes were generated using the "Quick Prime (Trade Mark) Kit 

30 (Pharmacia). 

BNA analysis . Total RNA was extracted from B. sphaencus P-l by 
the hot-phenol method of Aiba et al (1981). Total RNA isolated at different 
growth phases was run on a formaldehyde containing agarose gel and 
transferred to Hybond-N nylon membranes (Pharmacia) and hybridized 

35 according to the manufacturer's recommendations. Single stranded 

oligonucleotides (Figure 10) were used as primers in a primer extension 
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assay. 50/xg of RNA was mixed with 50 ng primer and ethanol- 
precipitated. The pellet was resuspended in 20 jd 5 x hybridization buffer 
(2 M NaCl, 50 mM PIPES pH 6.4, 5 mM EDTA) and 80 fil deionized 
formamide and incubated for 15 minutes at 85°C. Primer annealing 
proceeded over-night at 37° C. Primed RNA was ethanol-precipitated. The 
extension reaction was carried out for 90 minutes at 42°C in a 40 /ri 
mixture containing 50 mM Tris.HCl (pH 8.2), 10 mM dithiothreitol, 6mM 
MgCl 2 , 25 ng/ml actinomycin D, 250 /xM dCTP, dGTP, dTTP, 150 fiM 
dATP, 60 fid [or- 35 S] dATP (Amersham International) and 40 units Reverse 
Transcriptase. Upon completion of the reaction 2jd pancreatic RNase (1 
mg/ml) was added and incubated for another 20 minutes. Extension 
products were purified by phenol/chloroform extraction and subsequent 
ethanol precipitation. The size of the extended products was deduced by ^ 
comparison to a corresponding sequence ladder, generated with the same 
primer. 

Protein micro-seque nce analysis . The surface-layer protein from B, 
sphaericus P-l was isolated from cell walls by urea extraction as described 
by Lewis et al (1987). Upon sodium dodecyl sulfate polyacrylamide gel 
electrophoresis (SDS-PAGE) proteins were electro-blotted and immobilized 
on a treated glass fibre plate (Bauw et al, 1987) for NH 2 -terminal amino 
acid sequence determination on an automated ABI 473A Protein Sequencer 
(Applied Biosystems Inc.). 

Enzyme assay. ^-glucuronidase activity was measured essentially as 
described (Jefferson §t al, 1986) using p-nitrophenyl-/?-D-glucuronide as 
substrate. Cells were resuspended in 1 ml reaction buffer, supplemented 
with 0.01 % SDS. Permeabilization of bacterial cells was achieved by 
addition of 25 fi\ chloroform and vortexing for 10 seconds. After 
termination of the reaction, cells were pelleted and the cleared supernatant 
was used for O.D. measurement. 
2. Results 

Identification and cloning of the sip pe.np. The surface-layer protein of 
B. sphaericus P-l was purified and subjected to automated microsequence 
analysis. 21 NH 2 -terminal amino acid residues could be deduced: NH 2 -Ala- 
Gln-Val-Asn-Asp-Tyr-Asn-Lys-Ile-Ser-Gly-Tyr-Ala-Lys-Glu-Ala-Val-Gln- 
Ala-Leu-Val. Based on residues 11 to 19 of this sequence, a specific 
ougodeoxynucleotide probe mixture was synthesized with the following 
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sequence : 5'-GCYTGIACIGCYTCYTIIGCITAICC-3' wherein Y is C or 
T and wherein I is inosine. Several unique bands were revealed on 
Southern blots of restricted genomic DNA when hybridized to the P- 
labelled oligonucleotide probe. 

A preliminary restriction map was established from which it was 
deduced that the 5.0-kb EcoRI fragment very probably contained the 
complete sip gene. An EcoPJ-generated library of B^phaericus P-l 
genomic DNA in pUC18 was screened by colony hybridization. Despite the 
use of different hybridization conditions no subclones containing the 5.0-kb 
fragment could be isolated, suggesting that cloning of the entire gene or a 
large part of it in E^coU is lethal to the host. Therefore cloning of smaller 
restriction fragments, identified in the Southern blot analysis, was pursued. 

Screening of a HindlH-generated library resulted in the isolation of a 
pUC18 clone (pSLl), containing a 1.8-kb insert. Further analysis showed 
that the homology could be delineated to a small 100-bp HindHI/Pvun 
fragment. Sequence analysis confirmed this homology: the deduced ammo 
acid sequence of this region completely matched the sequence as obtained 
after microsequencing of the SLP subunit. Preceding this region a typical 
signal peptide sequence for secretion was detected. The pSLl clone thus 
20 contains the sip. promoter and a stretch encoding a 30-residue signal peptide 
and the first 20 amino acid residues of the SUP of B , sphaencus P-l. 

It also became clear that the originally identified 5.0-kb EcoRI fragment 
indeed contained the complete sip gene, including its own promoter. 
However due to inability to clone this fragment in E^coU, we were obliged 
25 to isolate the gene as a set of overlapping clones. In this way, three other 
P UC18 clones were isolated from several libraries using the previous 
fragment as probe: pSL4, containing a 0.8-kb PvuH fragment; pSLlO, 
harbouring a 1.6-kb BglH fragment and pSL20 carrying a 3.0-kb Hindm 
fragment (Figure 5). In total a region of 4.6 kb was spanned from wmch 
30 the DNA sequence was determined (Figure 6). 

thp d p ppnp. sequence . Analysis of the DNA sequence starting from 
the EcoRI site to the most downstream Hindm site revealed an open reading 
frame (ORF) of 3756 nucleotides, starting at the ATG initiation codon 
(position 95 to 97; Figure 6) and terminating at the stop codon TGA 
(position 3851 to 3853; Figure 6), whereas as many as 256 translation stop 
codons are dispersed over the 2 other reading frames. This ORF could 
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encode a polypeptide of 1252 residues with a deduced molecular mass of 
130,060 Da, which is 20 kDa less than the value deduced from SDS-PAGE. 
This discrepancy is probably due to the fact that the B. sphaericus P-l SUP 
is glycosylated (Lewis et aJ, 1987). Indeed, 20 potential N-linked 
glycosylation sites are distributed over the sequence. The protein has a 
calculated pi value of 4.59, which is in accordance with the experimentally 
determined value of 4.6 ± 0.4 (Lewis et aj, 1987).- 

The 3' end of the ORF is followed by a palindrome with a stem of 13 
base pairs (position 3904 to 3933; Figure 6) and a thymidine-rich stretch, 
which is typical for a Rho-independent transcription termination signal 
(Piatt, 1986). The start codon ATG is preceded by a potential ribosome- 
binding site 5'-AGGGAGG-3' (position 78 to 85; Figure 6). The 11 
nucleotide-spacing between the middle A of this motif and the ATG codon 
is typical of that found in gram-positive bacteria (Hager and Rabinowitz, 
1985). 

The deduced amino acid sequence was analyzed by the computerized ' 
method of Kyte and Doolittle (1982) for hydropathicity (Figure 7). The 
NH 2 -terminal sequence (30 residues) appears to be very hydrophobic, which 
is in accordance with the presence of a signal peptide responsible for 
secretion. It moreover ends by the sequence VASA, a motif frequently 
recognised by signal peptidases (von Heijne, 1986) and is then directly 
followed by the sequence determined by microsequence analysis of the 
mature SLP subunit. Several other hydrophobic regions, which might 
interfere with membrane translocation were observed at the COOH-terminus 
(Figure 7). 

Table m shows the amino acid composition of the SLP, which shares 
several features with other S-layer proteins. The only sulphur-containing 
amino acid is methionine (2 residues). It contains a high proportion of 
hydrophobic amino acids (38%), but it is not very enriched in acidic 
residues (10.6%) versus basic residues (8.3%), as is the case for the two S- 
layer proteins of B. brevis 47 (Tsuboi et al, 1988). No significant 
homology to other S-Iayer proteins was found, except with the SLP of JL 
sphaericus 2362 (Bowditch et al, 1989), at the level of the NH 2 -terminal 
sequence (Figure 8). The first 200 residues show a degree of 82% identity. 
However, no homology downstream of this region could be detected. This 
observation was confirmed by comparing the hydropathicity plots of both 
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proteins (data not shown). 

Tablem 
Amino acid« H^^ 



Amino Acid 



Threonine 

Valine 

Alanine 

Lysine 

Asparagine 

Glutamic acid 

Leucine 

Aspartic acid 

Phenylalanine 

Serine 

Isoleucine 

Tyrosine 
Glycine 
Glutamine 
Proline 
Arginine 
Tryptophan 
Methionine 
Histidine 
Cysteine 



Number 


% (molecular mass) 


loo 


12.8 


134 


10.2 


181 


9.9 


91 


9.0 


oo 


7.7 


69 


6.9 


64 


5.6 


62 


5.5 


47 


5.3 


78 


5.2 


55 


4.8 


37 


4.6 


95 


4.2 


34 


3.4 


31 


2.3 


11 


1.3 


6 


0.9 


2 


0.2 


1 


0.1 


0 


0.0 



10 



xv^ promoter. In order to examine how the sip gene is ^nbed 
in vivo total RNA was isolated at different growth phases: early and middle 
ioga^hmic phase, early stationary phase and from an overnight saturated 
culture Northern blot analysis demonstrated the presence of one smgle 
transcript of approximately 4500 nucleotides. The sip gene is expressed at 
high level up to early stationary phase. However a sharp decrease >s 
observed in a saturated culture, together with a simultaneous drop » rRNA 
le vels due to stringent response (Cashel and Rudd, 1987) (Rgure 9). These 
nigh levels of expression during most of the bacterial growth cycle axe to be 
expected in view of the continuous need of large amounts of SLP subumts 
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for the assembly of an intact surface-layer, even at stationary growth phase 
when the SUP is released into the medium (Howard and Tipper, 1973). 

Primer extension assays were performed to study the existence of 
complex multiple promoters involved in regulation of sip gene expression, 
5 such as in the case of B. brevis 47 (Adachi et al, 1989). Using two 
different primers the 5' end(s) of the transcript were detected. Three 
different transcription initiation sites were identified in both experiments at 
positions -184 (PI), -340 (P2) and -385 (P3) with respect to the first 
nucleotide (+1) of the start codon (Figure 10). Each transcription start site 

10 was preceded by a potential -10 and -35 motif as indicated in Figure 10. 

Spacing between both motifs corresponded to the preferred internal length 
(16 to 18 bp) for B. subtilis promoters (Moran et aJ, 1982). 

g-Glucuronidase fusions to study sip gene expression . The sip 
promoter was fused to the /S-glucuronidase (uidA) reporter gene to examine 

15 the expression characteristics of this complex 5'-upstream region. Through 
PCR-mediated site-specific mutagenesis a Ncol site was generated at the 
ATG start codon of the sip gene. The promoter was then isolated as a 
Xbal/Ncol fragment and fused to the uidA ORF at the Ncol site in pSL150, 
yielding pSL151. 

20 Through the combined action of ExoIH/Sl nuclease a set of progressive 

deletions towards the ATG start codon was generated and introduced into 
pSL150 as Xbal/Ncol fragments. The exact end point of each deletion was 
determined by sequence analysis (Figure 11). This set of plasmids (pSL151 
to pSL159) was introduced into B. sphaericus P-l by elect rotransformation 

25 as described in Example 1 and /S-glucuronidase activity was monitored. 

The results are shown in Figure 11 and can be summarized as follows: 
deletions up to approximately position -150 are completely abolished in 
uidA expression. Indeed, according to the primer extension assay these 
mutants are devoid of any of the three identified promoters. Deletions 

30 removing sequences up to position 

-375 show a threefold increase in /3-glucuronidase activity as compared to 
pSL151. These constructs only contain promoter PI. All smaller deletions 
show again wild-type levels of /^-glucuronidase activity. In these mutants 
all three promoters are intact again. 

35 Effect of Ca 2 + on sip expression . In several cases it has been reported 

that Ca 2+ plays a key role in the assembly of the surface-layer on the 
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bacteria (Feraldo et aj, 1991 Yang et aj, 1992). Moreover, Adachi and co- 
workers (1991) observed that Ca 2+ repressed the expression of the cell wall 
protein gene operon of B. brevis 47. In this context the previously 
constructed PsI p-nidA fusions proved to be excellent tools to monitor the 
5 possible effect of Ca 2+ on sip gene expression. Bacteria were grown in LB 
medium supplemented with an overdose Ca 2+ (7.5 mM) and compared to 
cells grown in the absence of Ca 2+ . ^glucuronidase activity was measured 
4 hours after dilution of the cultures (1/100) and simultaneous addition of 
Ca 2+ to the medium. As can be seen in Figure 12, addition of Ca 2+ 

10 resulted in a two-fold reduction of 0- glucuronidase activity in all mutants up 
to position -440, whereas mutants containing only promoter PI were 
immune to this negative effect. These results suggest that the Ca 2+ 
repression is located at promoters P2 and/or P3. These observations were 
confirmed when assaying enzyme activity 24 hours after addition of Ca 2+ 

15 (data now shown). 
Example 3 

1. Materials and Methods 

R a rte.ria1 strain s and plasmids . Growth media and selective antibiotic 
concentrations for B. sphaericus P-l have been described in Example 1. 

20 The plasmid pSL64, used as a basis for the construction of the different 

intermediate vectors, was isolated by insertion of a promoterless npJS gene 
as a 1.12-kb BamHI/Sall fragment from pKm 109/2 (Reiss et aj, 1984a) into 
the bifunctional, erythromycin resistance (Em R ) encoding vector pSL40 of 
Example 1 (Figure 14A). The nucleotide sequence of the linker preceding 

25 the nptn gene in pSL64 is shown in Figure 14B1 . 

A similar plasmid (pSLlOl) was constructed by exchange of the 
BamHI/Ncol fragment of pSL64 for a similar sized fragment of pLKM92 to 
shift the reading frame of the nptn gene, compared to the multicloning site. 
pLKM92 is P Kml09/90 (Reiss et aj, 1984a) with a slightly modified 

30 polylinker. The nucleotide sequence of the linker preceding the nptn gene 
in pSLlOl is shown in Figure 14B2. 

pSL4, pSLlO, and pSL20 are subclones of the sip gene of Example 2 
and are indicated on Figure 15. Intermediate vectors, constructed in the 
course of this study are summarized in Table IV: 



- 41 - 



Table IV 
Intermediate Vectors 



Plasmid Name 



Cloned internal part of SLP* 



Cloning Vector 



pSL66 
pSL68 
pSL69 
pSL70 
pSL71 
pSL102 
pSLlll 
pSL113 



2048-2854 
1470-2476 
443-2053 
443-1609 

49-807 
443-2053 
443-2053 
2084-2854 



pSL64 
pSL64 
pSL64 
pSL64 
pSL64 
pSLlOl 
pSL64 b 
pSLlOl 



• bp 1 is the A base of the start codon; b containing the 554-bp Sau3A 
fragment encoding the central part of SI. 

Transformation. Competent E. coli MCI 061 strains were prepared and 
transformed according to Kushner (1978), whereas transformation of JL 
sphaericus P-l was achieved by electroportation as described in Example 1. 

Sacculi preparation. Native sacculi were prepared by a protocol 
described by Sara and Sleytr (1987) and modified for B. sphaericus P-l or : 
derivative strains were grown overnight at 37°C in TB medium (Tartof and 
Hobbs, 1987) on a gyratory shaker. Cells were harvested by centrifugation, 
resuspended in 50 mM Tris-HCl, pH 7.2 (50 ml per 100 g pellet), and 
sonicated for 1 min (40 Watt, using a Bransic Sonic Power Co. sonicator). 
Triton X-100 was added to a final concentration of 2%, and the mixture 
was incubated, with agitation, for 30 min at 50°C. Treated cells were 
collected by centrifugation (15,000 g, 10 min), and washed three times with 
cold, distilled H 2 0. The pellet was resuspended in 5 mM MgCl 2 , 
containing DNAse (5 ^g/ml) and RNAse (20 fig/ ml), and incubated for 15 
min at 37<>C. The resulting native sacculi were pelleted, washed three times 
with cold distilled H 2 0, and resuspended in 20 ml buffer (20 mM Tris-HCl, 
pH 7.2, 2.5 mM CaCl 2 , and 2 mM phenylmethylsulfonylfluoride). 

Enzymatic assays. Enzymatic assays were performed either on 
.trichloroacetic acid (TC A) -precipitated culture supernatants or on insoluble 
and soluble fractions of sonicated cells (4 times 10 sec, 40 W). NPTH 
activity was assayed by the in situ phosphorylation assay after separation of 
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the proteins on non-denaturing polyacrylamide gels (Reiss et al, 1984b). 
Nicotinamide adenine dinucleotide (NAD) glycohydrolase activity was 
measured as the release of "C-labeUed nicotinamide from [carbonyl- 
,4 C]NAD as described by Locht et al, (1987). 
5 sns-PAGE anri immunoblotting . Sodium dodecyl sulfate 

polyacrylamide gel electrophoresis and Western blotting were performed by 
standard procedures (Laemmli, 1979). The filters were blocked with 2% 
Tween 20 and incubated with an 1:1000 dilution of the specific rabbit (anti- 
NFTII) or goat (anti-pertussis toxin [FT]) serum, followed by alkaline 
10 phosphatase conjugated goat anti-rabbit or mouse anti-goat IgG (Bio-Rad) as 
described by the manufacturers. 

Tmm„no pft nicitv of Tttmmhinant barteria w ith composite Sjayersjn 
mice Groups of five female Balb-C mice were injected (intraperitoneal^ 
with different titers of recombinant P-l::pSLlll bacteria (10 7 , 10", and 10 
15 colony forming units (CFU)), corresponding to an estimated amount of 0. 1 , 
1 and 10 ng of SI subunit of pertussis toxin, either with or without 
Freund's adjuvant. Control experiments included purified SI subunit of FT, 
and recombinant P-l::pSL102 bacteria. Injections were repeated after 3 and 
9 weeks. At week 12, sera samples were collected. Mice sera were 
20 screened for the presence of antibodies directed against FT SI suburut in a 
sandwich ELISA. FT was used as antigen and captured by a Guinea pig 
antiserum. 

r^. n \ recombinant nNA techniques. These were according to 
Sambrook et al, (1989). Restriction enzymes were purchased from New 

25 England Biolabs, Pharmacia or Bethesda Research Laboratories and were 
used according to the manufacturers' recommendations. Elution of DNA 
fragments separated by agarose gel electrophoresis was done using Gene 
Clean (Trade Mark) kit (BiolOl Inc. La Jolla, California, US). Southern 
transfer and hybridization were performed using Hybond N + membranes 

30 (Amersham) and Quickprime (Trade Mark) labelling kit (Pharmacia) to 
prepare "P-labelled probes. 
2. Results 

Anal ysis of carhnw-te.rmina) deletions of B sphaericus P-l SLP. 
Sequence analysis of the 4.5-kb sip gene of Rjphaericus P-l revealed that 
35 the predicted first 200 amino acids of SLP were highly similar to the 

deduced sequence of the SLP of B. sphaericus 2362 (Bowditch et al, 1989), 
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whereas the remaining parts of the proteins were highly divergent. 
Potential transmembrane helices were predicted to be located near the 
carboxy-terminal ends of both proteins (Bowditch et a], 1989). Therefore, 
these conserved motifs might be important in the build-up or anchoring of 
5 theS-layer. The fraction of the non-homologous COOH-terminal part of IL 
sphaericus P-l SLP that could be removed without notable interference with 
S-layer assembly was determined by progressive deletions. 

Therefore, several intermediate vectors were constructed (based on the 
bifunctional vector pSL64; Table IV) that contained different internal 
10 fragments of the sip gene, cloned upstream of a promoterless nptn gene 

(pSL66, pSL68, pSL69, pSL70, pSL71; Table IV and Figure 15A). These 
constructs were introduced into B. sphaericus P-l by electroporation and 
selection for Em R transform ants. Neomycin-resistant (Nm R ) colonies, and. 
thus putative single homologous recombinants, were obtained for the P-l 
15 strains transformed by almost all intermediate vectors, except for pSL71 

which contained the most amino-terminally located fragment of the sip gene. 

Southern analysis revealed the patterns expected for single homologous 
recombination (see Figure 16 for pSL69 integration). Culture supematants, 
insoluble cell debris, and soluble cell contents after sonication of the strains 
20 carrying the different carboxy-terminal deletions were analyzed by SDS- 
PAGE electrophoresis (Figure 15B). Abundant amounts of proteins with 
the expected molecular masses (see Figure 15 A) could be readily observed 
in the insoluble cell fraction. In contrast with the rather abundant presence 
of SLP in culture supematants of wild-type P-l, no such proteins were 
25 observed in culture supernatant of strains expressing truncated SLPs, except 
for P-l::pSL69 (data not shown for P-l::pSL70). These data suggest that, 
whereas the carboxy-terminal part of the SLP and in particular residues 
numbered 536 to 1252 in Figure 6 are dispensable, the amino- terminal part 
and especially residues numbered 31 to 269 in Figure 6 are absolutely 
30 required for viability of P-l cells. Residues numbered 31 to 269 in Figure 
6 constitute the N-terminal 239 or 19.56% residues of the mature SLP. 

Translational fusion of reporte r proteins to the carhoxv-terminn<; of 
truncated SLPs. To determine whether the deletable part of SLP can be 
replaced by a protein of interest, we fused reporter proteins NPTH and the 
35 soluble fragment of the subunit SI of toxin produced by Bordetella pertussis 
(PT) to the carboxy-terminus of truncated SLPs. To achieve this, a similar 
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strategy as described above was used with the modification that the 
intermediate vectors now contained an internal sjp. part, translationally fused 
to either nptn (pSL102, pSLl 13; Table IV) or PT in fusion with OTTO 
(pSLlll; Table IV). Figure 17 summarises the expected fusion proteins 
5 generated by the different intermediate vectors. Nm R colonies, putative 

integrants, were selected after introduction of these intermediate vectors into 
P-l by electroporation and selection of Em* transformants. Southern 
hybridizations of Bg IE-digested total DNA of these candidates with 32 P- 
labelled pSL20 (containing a 4-kb BglH fragment of the sip gene; Figure 
10 15A), revealed the expected patterns after single homologous recombination 
through the cloned sip part (Figure 16). 

In SDS-PAGE analysis of total protein extracts of recombinant strains 
P-l::pSL69, P-l::pSL102 and P-l::pSL113 (Figure 18 A), major protein 
bands were observed at 74 kDa, 102 kDa and 130 kDa, respectively, the 
15 sizes expected for the truncated SLP or the two different fusion proteins. In 
total protein extracts of P-l::pSLlll (Figure 18C), however, only a faint 
protein band of the expected size (120 kDa) could be observed. The fusion 
SLPs fractionated predominantly to the cell debris after sonication, as was 
also observed for the truncated SLPs. 
20 Western blottings of similar SDS-PAGE gels were challenged with anti- 

NPTH or anti-PT antibodies. Anti-NPTH reacted only with the 102-kDa 
and 130-kDa proteins in P-l::pSL102 and P-l::pSL113 protein extracts, 
respectively (Figure 18C), whereas in PlrtpSLlll extracts, the 120-kDa 
protein was revealed (data not shown). No cross-reaction was observed 
25 with proteins from P-l::pSL69. Anti-PT detected two specific proteins in 
P-l::pSLlU (120 kDa and 90 kDa), and four specific low-molecular mass 
proteins, that were also revealed in P-l::pSL69 extracts (Figure 18D). 
Signals with both anti-PT and anti-NPTH were significantly enhanced when 
using proteins from native sacculi preparations (see below). 
30 Reporter protein* fused to SLP retain their enzymatic activity. Because 

both proteins used as reporters in this study exhibit enzymatic activity that 
can be relatively quickly assayed, we determined whether the fusion 
proteins retained these catalytic abilities. 

Kanamycin phosphorylation activity of fusion proteins was determined 
35 by the m gel assay, using either TCA-precipitated culture supematants, or 
cell debris and soluble fraction after sonication. Significant phosphorylation 
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activity was observed in all recombinant strains (but not in P-l or in P- 
l::pSL69 extracts) and was confined almost exclusively to the insoluble cell 
debris fraction after sonication (Figure 19). 

NAD-glycohydrolase activity, specified by the SI subunit of the 
pertussis toxin, was determined on TCA-precipitated culture supernatant, or 
cell debris and soluble fraction after sonication of the recombinant strain P- 
l::pSLlll (Table V), in comparison to a calibration curve using purified 
PT toxin. Again, significant enzymatic activities were only detected in 
cellular debris fraction of P-l::pSLlll. The apparently high, a specific 
hydrolase activity detected in the supernatant of both P-l::pSLlll and P- 
l::pSL69 is due to acid hydrolysis caused by TCA residues from the 
precipitation (data now shown). 

Table V 



NAD-glvcohvdrolase activities of recombinant P-l strains 



Protein Source 


Released C14-nicotinamide (cmp) 


PT toxin (fig/ml) 




0 


1860 


1 


5350 


5 


10520 


10 


14810 


20 


21850 


40 


26050 


P-l::pSL69 




cell debris 


1340 


sonicate 


1480 


P-l::pSLlll 




cell debris 


5380 


sonicate 


1530 



Carboxv-terminal fu sions to SLP assemble in a functional S-laver . To 
address the question whether fusion proteins between truncated SLP and 
NPTTI assemble into a S-layer, intact bacteria were immunogold-labelled 
using anti-NPTII antibodies. Significant accumulation of label on the 
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10 



15 



bacteria was observed with P-l::pSL102 and to a lesser extent with P- 
l::pSL113 (Figures 20A and 20B). No background label could be found 
using either P-l or P-l::pSL69 (which contains an intracellular NPITI 
protein). 

Circumstantial evidence for the assembly of the fusion proteins into an 
S-layer, comes from the preparation of ghost cells, or native sacculi as 
described by Sara and Sleytr (1987), in which cytoplasm and membrane are 
removed without affecting the structural integrity of the peptide-glycan layer 
and the S-layer. Application of this protocol to recombinant P-l::pSL102, 
and P-l::pSLlll strains resulted in sacculi preparations, significantly 
enriched in fusion S-layer protein (Figure 21A). Western blots, challenged 
with anti-NPTEI or anti-PT detected readily the fusion proteins (Figures 2 IB 
and 21C). 

The immunogenicity of the composite S-layers was determined by 
injection of the recombinant bacteria P-l::pSLlll, expressing the SI 
subunit of PT fused to SLP, along with purified SI and recombinant P- 
l::pSL102 bacteria as controls. SI antibody titers were determined after 12 
weeks by EOS A (Table VI). A significant higher amount of Sl- 
recognizing antibodies were detected in blood samples of the groups of mice 
injected by the highest concentration of the recombinant bacteria expressing 
the SI -subunit in a composite S-layer (P-l::pSLlll). 
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Table VI 

Immunogenicity of recombinant P-l::pSLlll in Balb/c mice 





Antigen 
G*g) 


Dilution l.zU 


.Dilution x.ou 


SI - 


0.1 
1 

10 


19.7 
426.1 
465.5 


14.9 
402.1 
460.8 


SI + AF 


0.1 

1 

10 


132.5 
418.2 
457.8 


56.0 
432.4 
453.7 


min 


0.1 

1 

10 


49.5 ] 
39.8 
238.7 


2.2 
1.9 
116.8 


mill + af 


0.1 

1 

10 


160.9 
158.8 
354.3 


* 75.7 
70.3 
210.8 


mi02 


0.1 

1 

10 | 


34.5 
43.9 
67.6 


3.8 
1.7 
6.5 


mi02 + AF 


0.1 

1 

10 


117.0 
151.1 
160.1 


16.8 
57.4 
65.2 



Values are geometric means of FJJSA titer readings of five independent 
injections. AF, Freund's adjuvant; mill, intact bacteria P-l::pSLlll; 
IB 102, intact bacteria P-l::pSL102. 

Table VH 

5 Detailed information on the construction of intennediate 

vectors for the construction of truncated SLP's and 
composite SLP's. 
Note on numbering: base 1 = A of ATG of the slg gene 

(= mRNA numbering 4- 95) 
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Sim ple SLP clones 

pSL4 = 1536 bp Hindm fragment in pUC18 HindHI. This fragment 
contains the complete s]p_ promoter and sip gene portion 1 to 
142. 

pSL5 = sip. gene portion 49-807 (PvuH-PvuH) in pUC18 Smal. 
pSLlO = sip gene portion 443-2053 (BglH-BgUD in pUC18 BarnHI. 
pSL20 = 3048 bp HindHI fragment in pUC18 HindHI. This fragment 

contains the sip. gene portion 1049 to end of ORF (3759) and the 
transcription termination signal. 

Truncated SLP intermediate vectors 



Name 


Cloned sip 
part 


Restriction 
site and clone 


Site in 
pSL64 


pSL66 


2048-2854 


pSL20 BglH 


BarnHI 


pSL68 


1470-2476 


pSL20 PvuH 


BcoRV 


pSL69 


443-2053 


pSLlO Xbal/EcoPJ 


Xbal/EcoRI 


pSL70 


443-1609 


pSLlO Xbal/Hpal 


Xbal/EcoRV 


pSL71 


49-807 


pSL5 Xbal/EcoRI 


Xbal/EcoRI 



10 Com posite SLP interm ediate vectors 

pSL109: 554 bp Sau3A fragment cloned in BarnHI site of pSL64 and 

selection of the correct orientation. 
P SL102: EcoRI-Bgim fragment of pSL20 (1049-2053) in BarnHI site of 

pSLlOl. 

15 pSLl 13: BglH fragment of pSL20 (2048-2854) in BarnHI site of pSLlOl . 
pSLl 1 1 : Same as pSL102 in BarnHI site of pSL109 . 
pSL40 = 7453 bp high copy number bifunctional vector, Example 1 and 
Figure 4A. 

pSL84 = 6679 bp low-copy number bifunctional vector, Example 1 and 
20 Figure 4B. 

pSLl 50 = promoterless /3-glucuronidase gene isolated as 2558 bp 

Bamm/Sphl fragment and cloned in pSL40 BamHI/SphI, 

Example 2. 

pSL15 1-159 = Xbal/Ncol fragments carrying progressive 
25 deletions of the sip promoter into pSL150 



WO 95/19371 




PCT/EP95/00147 



Xbal/Ncol. Exact end points of the deletions are 
indicated in Example 2 and Figure 10. Ncol site 
coincides with ATG start codon. 
pSL64 = 1.12 kb BamHI/SaU fragment, carrying promoterless nptll gene 
5 from pKm 109/2 into pSL40 BamHI/Sall, Example 3 and Figure 

14A. 

pSLlOl = essentially the same as pSL64, but having another DNA linker 
in front of the nptn gene, Example 3 and Figure 14B. 
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CLAIMS 

1 - A host cell which is provided with a S-layer comprising a fusion 
polypeptide consisting essentially of : 

(a) at least sufficient of a S-layer protein for a S-layer composed thereof to 
assemble, and 

(b) a heterologous polypeptide which is fused to either the carboxy 
terminus of (a) or the amino terminus of (a) and which is thereby 
presented on the outer surface of the said cell. 

2 - A cell according to claim 1 which is a bacterium of the genus 
Bacillus . 

3 - A cell according to claim 2 wherein the bacterium is B. sphaericus 
P-l (LMG P-13855). 

£ 

4 - A cell according to any one of the preceding claims, wherein the / 
heterologous polypeptide is fused to either the carboxy terminus or the 
amino terminus of the most N-terminal 41 % or more amino acid residues of 
a S-layer protein. 

5 - A cell according to any one of the preceding claims, wherein the S- 
layer protein is derived from B. sphaericus . 

6 - A cell according to any one of the preceding claims, wherein the 
heterologous polypeptide is an antigenic peptide. 

7 - A cell according to claim 6, wherein the heterologous polypeptide 
comprises an antigenic determinant of a pathogen selected from a virus, 
bacterium, fungus, yeast and parasite. 

8 - A cell according to claim 6, wherein the heterologous polypeptide is 
selected from the P69 antigen of Bordetella pertussis , pertussis toxin or a 
subunit thereof, tetanus toxin fragment C, E. coli heat labile toxin B subunit 
and an E. coli K88 antigen. 



9 - Sacculi derived from a host cell as claimed in any one of the 
preceding claims. 
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10 - A pharmaceutical or veterinary composition comprising a host cell 
as claimed in any one of claims 1 to 8 or sacculi as claimed in claim 9 and 
a pharmaceutical^ or veterinarily acceptable carrier or diluent. 

11 - A vaccine comprising a host cell as claimed in any one of claims 6 
5 to 8 or sacculi as claimed in claim 9 when dependent on any one of claims 

6 to 8 and an acceptable carrier or diluent therefor. 

12 - A recombinant DNA molecule which comprises a promoter 
operably linked to a coding sequence which encodes a signal peptide and a 
fusion polypeptide, the signal peptide being capable of directing the said 

10 fusion polypeptide to be presented on the surface of a host cell in which 
expression occurs and the fusion polypeptide consisting essentially of a 
heterologous polypeptide fused to either the carboxy terminus or the amino 
terminus of at least sufficient of a S-layer protein for a S-layer composed 
thereof to assemble. 

15 13 - A molecule according to claim 12, wherein the promoter is a 

promoter for a S-layer protein from a Bacillus bacterium. 

14 - A molecule according to claim 13, wherein the promoter is the PI 
promoter nf B. sphaericus P-l (LMG P- 13855). 

15 - A molecule according to any one of claims 12 to 14, wherein the 
20 signal peptide is the signal peptide for the S-layer protein of which an 

appropriate portion is incorporated in the fusion polypeptide. 

16 - A molecule according to any one of claims 12 to 15, wherein the 
signal peptide is a signal peptide for a S-layer protein of a Bacillus 
bacterium. 

25 17 - A molecule according to any one of claims 12 to 16 which is an 

expression vector. 

18 - A molecule according to claim 17, wherein the vector is a plasmid. 



19 - A host cell which incorporates a recombinant DNA molecule as 
claimed in any one of claims 12 to 16. 



95/19371 




PCT/EP95/00147 



20 - A host cell according to claim 19 which has been transformed with 
a vector as claimed in claim 17 or 18. 

21 - A cell according to claim 19 or 20 which is a gram-positive 
bacterium. 

22 - A cell according to claim 19 or 20 which is a bacterium of the 
genus Bacillus . 

23 - A cell according to claim 22 wherein the bacterium is R 
SEhaericus P-l (LMG P-13855). 

24 - A process for the preparation of a host cell provided with a S-layer 
comprising a fusion polypeptide, which process comprises: 

(i) providing a suitable host cell incorporating a recombinant DNA 
molecule which comprises a promoter operably linked to a coding 4 
sequence which encodes a signal peptide and a fusion polypeptide, the 
signal peptide being capable of directing the said fusion polypeptide to 
be presented on the surface of the said host cell and the fiision 
polypeptide consisting essentially of a heterologous polypeptide fused to 
either the caiboxy terminus or the amino terminus of at least sufficient, 
of a S-layer protein for a S-layer composed thereof to assemble; and 

(ii) cultuiing the said host cell so that the said fusion polypeptide is 
expressed and a S-layer comprising the fusion polypeptide is formed on 
the surface of the said host cell, the heterologous polypeptide thereby 
being presented on the outer surface of the said host cell. 

25 - A process according to claim 24, which comprises : 

(a) providing an intermediate vector in which the coding sequence of an 
internal portion of the native S-layer protein of the said host cell has 
translationally fused to the 3' -end thereof the coding sequence for the 
heterologous polypeptide and in which the said coding sequences are 
provided upstream of a promotorless selectable marker gene such that 
they form a translational or transcriptional fusion therewith; 

(b) transforming the said host cell with the intermediate vector; 

(c) selecting a transformed host cell which has a S-layer comprising the 
said fusion polypeptide. 
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(C) 



(d) 



26 - A process according to claim 24 which comprises : 

(a) fusing to a promoter a S-layer protein coding sequence coding for the 
signal peptide and at least sufficient of the amino-terminal portion of a 
S-layer protein for a S-layer composed thereof to assemble on the 

5 surface of the host cell, and fusing a peptide coding sequence coding 

for the heterologous polypeptide to the 3'-end of the S-layer protein 
coding sequence, whereby a recombinant DNA molecule for the 
expression and presentation of the fusion polypeptide is prepared; 

(b) inserting the recombinant DNA molecule into a suitable vector, 
10 whereby a recombinant DNA vector is prepared; 

transforming a suitable host cell with the recombinant DNA vector, 
whereby a transformed host cell having the recombinant DNA molecule 
is provided; 

culturing the transformed host cell, whereby the fusion polypeptide is 
15 expressed and a S-layer comprising the fusion polypeptide is assembled 

on the host cell wall. 

27 - A promoter having a -35 region of the sequence TTGAAT and a - 
10 region of the sequence TAT ATT. 

28 - A promoter according to claim 27, having the sequence 

20 CTAAATTTATGTCCCAATGCTTGAATTTCGGAAAAGATAGTGTTAT 

ATTATTGT. 

29 - A promoter having a -35 region of the sequence CTTGGTT and a 
-10 region of the sequence TATAAT. 

30 - A promoter according to claim 29, having the sequence 

25 TCCAGAAAATGCTTGGTTATTATTGAGAGTAAGGTATAATAGGTA. 

31 - A promoter having a -35 region of the sequence ATTACGGGA 
and a -10 region of the sequence TTTAGT. 

32 - A promoter according to claim 31, having the sequence 
AAAATATTACGGGAGTCTTTAATTTTTGACAATTTAGTAACCAT. 

30 33 - A promoter according to any one of claims 27 to 32, having the 

sequence from nucleotide 52 to 353 shown in Figure 10. 
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34 - An expression vector which comprises a promoter as defined in 
any one of claims 27 to 33 and a downstream cloning site into which a 
DNA sequence encoding a heterologous protein may be cloned such that the 
promoter is operably linked to the said sequence. 

35 - An expression vector which comprises a promoter as defined in 
any one of claims 27 to 33 operably linked to a DNA sequence encoding a 
heterologous protein. 

36 - A DNA fragment comprising a promoter according to any one of 
claims 27 to 33 operably linked to a DNA sequence encoding a heterologous 
protein. 

37 - A host cell transformed with an expression vector as defined in 
claim 35. 

38 - A process for the preparation of a heterologous protein, which 
process comprises culturing a transformed host cell according to claim 37 
and obtaining the heterologous protein thus expressed. 

39 - A pharmaceutical or veterinary composition comprising a 
pharmaceutically or veterinarily acceptable carrier or diluent and, as active 
ingredient, a physiologically active heterologous protein which has been 
obtained by the process of claim 38. 

40 - A process of transforming B. sphaericus P-l cells with DNA, 
which process comprises harvesting B. sphaericus P-l cells at the late 
stationary growth phase, mixing the harvested cells with the DNA and 
effecting electroporation to cause entry of the DNA into the said cells. 

41 - Use of a host cell as claimed in any one of claims 1 to 8 for 
immobilisation purposes. 

42 - Use of a host cell as claimed in any one of claims 1 to 8 for 
screening purposes. 
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FIG. 13 
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. FIGURE 6 (Fig. 6a-6i) 

FIGURE 6a 

GAAAGCTATA ATACATACAT TTAGGTAACT AGGCGGTACT ATAGTTTTCG TTGGATTAAT 

ATCAATTTAA GGAATTTTAG GGAGGAATAC ATTA ATG GCA AAG CAA AAC AAA 

Met Ala Lys Gin Asn Lys 

GGC CGT AAG TTC TTC GCG GCA TCA GCA ACA GCT GCA TTA GTT GCA TCG 
Gly Arg Lys Phe Phe Ala Ala Ser Ala Thr Ala Ala Leu Val Ala Ser 

GCA ATC GTA CCT GTA GCA TCT GCT GCA CAA GTA AAC GAC TAT AAC AAA 
Ala lie Val Pro Val Ala Ser- Ala Ala ...Gin .. Val ...Asn Asp Tyr Asn Lys 

ATC TCT GGA TAC GCT AAA GAA GCA GTT CAA GCT TTA GTT GAC CAA GGC 
Ile.Ser.GlY.Tyr Ala Lys..G^ Asp Gin Gly 

GTA ATC CAA GGT GAT ACT AAC GGG AAC TTC AAC CCA CTT AAC ACA GTA 
Val He Gin Gly Asp Thr Asn Gly Asn Phe Asn Pro Leu Asn Thr Val 

ACT CGT GCA CAA GCT GCA GAA ATC TTC ACA AAA GCT TTA GAA TTA GAA 
Thr Arg Ala Gin Ala Ala Glu He Phe Thr Lys Ala Leu Glu Leu Glu 

GCT AAC GGA GAT GTA AAC TTC AAA GAC GTG AAA GCT GGC GCT TGG TAC 
Ala Asn Gly Asp Val Asn Phe Lys Asp Val Lys Ala Gly Ala Trp Tyr 

TAC AAC TCA ATC GCT GCT GTT GTA GCT AAC GGC ATT TTT GAA GGT GTT 
Tyr Asn Ser He Ala Ala Val Val Ala Asn Gly lie Phe Glu Gly Val 

AGT GCA ACT GAA. TTT GCA CCA AAC AAA TCT TTA ACT CGT TCT GAA GCT 
Ser Ala Thr Glu Phe Ala Pro Asn Lys Ser Leu Thr Arg Ser Glu Ala 
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GCT AAA ATT TTA 
Ala Lys lie Leu 

AGC GAA TTT GCT 
Ser Glu Phe Ala 

TTA GAA ATC GCA 
Leu Glu lie Ala 



GTA GAA GCA TTC 
Val Glu Ala Phe 

GAC GCT TCT CAA 
Asp Ala Ser Gin 

GTA GCT AAC GGC 
Val Ala Asn Giy 
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GGT TTA GAA GGT 
Gly Leu Glu Gly 

GTA AAA CCT TGG 
Val Lys Pre Trp 

ATT TTC GAA GGT 
He Phe Glu Gly 
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FIGURE 6b 

GAA GCA GAT CTT 
Glu Ala Asp Leu 

GCT AAA AAA TAG 
Ala Lys Lys Tyr 

ACT GAT GCA AAC 
Thr Asp Ala Asn 



GAA GAC TTT GCA CTA GTCz 
Gin Asp Phe Ala Leu Val 



TTC AAA CGT ACA GTT GAC AAA GTT GAA GGT GAA ACT CCA GAA GAA GCA 
Phe Lys Arg Thr Val Asp Lys Val Glu Gly Glu Thr Pro Glu Glu Ala 

GCA TTT GTT AAA GCT ATC AAC AAC ACA ACT GTT GAA GTA ACA TTC GAA 
Ala Phe Val Lys Ala He Asn Asn Thr Thr Val Glu Val Thr Phe Glu 



AAA CTT AAC CCT AAC AAC TCA ATC ACT CC-i 
Lys Leu Asn Pro Asn Asn Ser He Thr Arg 



GAA GAA GTT ACT AAC GTT CAA GCA 
Glu Glu Val Thr Asn Val Gin Ala 

GAA ATT AAA AAT GCT TCT GTT AAA 
Glu He Lvs Asr. .Ala Ser Val Lys 



CTT AAC TTC AAA ATC GAA GGT TTA 
Leu Asn Phe Lys He Glu Gly Leu 

CAA ACA -AAC AAA AAA GTT GTT GTA 
Gin Thr A s n L v s L y s Y a 1 Y a 1 Val 



TTA ACT ACT GAA GCT CAA ACA GCT GAT AAA GAG TAT GTT TTA AC i C x . 
Leu Thr Thr Giu Ala Gir. Thr Ale. Asp Lys Glu Tyr Val Leu Thr Leu 



G*C GGC GAA ACA ATC GCT GCT TTT AAA GGT GTG GCT GCT GTA GTT CCA 

?*-d Gly Glu Thr lie Glv Gly ?he Lys Gly Val Ala Ala Val Val Pre 
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FIGURE 6 



ACT AAA GTT GAA CTA GTA TCT TCT GCA GTT CAA GGT AAA CTT GGT CAA 
Thr Lys Val Glu Leu Val Ser Ser Ala Val Gin Gly Lys Leu Gly Gin 

GAA GTA AAA GTT CAA GCT AAA GTA ACT GTT GCT GAA GGT CAA TCT AAA 
Glu Val Lys Val Gin Ala Lys Val Thr Val Ala Glu Gly Gin Ser Lys 

GCT GGT ATT CCT GTT ACT TTC ACT GTA CCA GGT. AAC AAC AAT GAT GGC 
Ala Gly lie Pro Val Thr Phe Thr Val Pro Gly Asn Asn Asn Asp Gly 

GTT GTA CCA ACA TTA ACA GGT GAA GCT TTA ACA AAC GAA GAG GGT ATC 
Val Val Pro Thr Leu Thr Gly Glu Ala Leu Thr Asn Glu Glu Gly He 

GCA ACA TAC TCT TAC ACT CGT TAT AAA. GAA GGT ACT GAT GAA GTA ACT 
Ala Thr Tyr Ser Tyr Thr Arg Tyr Lys Glu Gly Thr Asp Glu Val Thr 

GCT TAT GCA ACT GGT GAT CGT TCT AAA TTC TCA CTT GGT TAT GTA TTC 
Ala Tyr Ala Thr Gly Asp Arg Ser Lys Phe Ser Leu Gly Tyr Val Phe 

TGG GGT GTA GAT ACA ATT CTT TCA GTT GAA GAA GTA ACT ACA GGT GCT 
Trp Gly Val Asp Thr He Leu Ser Val Glu Glu Val Thr Thr Gly Ala 

TCA GTT A^T AAT GGT GCA AAC AAA ACT TAC AAA GTT ACT TAT AAA A^.C 
Ser Val Asn Asn Gly Ala Asn Lys Thr Tyr Lys Val Thr Tyr Lys Asr. 

CCT AAA AC T GGT AAA CCA GAB GCA AAC AAA ACA TTT AAT GTT GGT TTT 
Pro Lys Thr Gly Lys Pro Glu Ala Asn Lys Thr Phe Asn Val Gly Phe 
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FIGURE 6d 



GTA GAA AAC ATG AAT GTT ACT TCT GAT AAA GTA GCA AAT GCT ACA GTT 
Val Glu Asn Met Asn Val Thr Ser Asp Lys Val Ala Asn Ala Thr Val 

AAT GGC GTA AAA GCA TTA CAA TTA AGC AAT GGT ACA GCT TTA GAG GCT 
Asn Gly Val Lys Ala Leu Gin Leu Ser Asn Gly Thr Ala Leu Asp Ala 

GCT CAA ATT ACA ACA GAT TCT AAA GGT GAA GCT ACA TTC ACA GTT TCT 
Ala Gin lie Thr Thr Asp Ser Lys Gly Glu Ala Thr Phe Thr Val Ser 

GGT ACT AAT GCA GCT GTA ACG CCA GTA GTA TAT GAT CTA CAC AGC ACT 
Gly Thr Asn Ala Ala Val Thr Pro Val Val Tyr Asp Leu His Ser Thr 

AAC AAT AGT ACT TCA AAT AAA AAA TAT ACT GCA TCT GCT TTA CAA ACT 
Asn Asn Ser Thr Ser Asn Lys Lys Tyr Ser Ala Ser Ala Leu Gin Thr 



ACT GCT TCT AAA GTA ACT TTC GCT GCT CTT CAA GCA GAG TAT ACA ATT 
Thr Ala Ser Lys Val Thr Phe Ala Ala Leu Gin Ala Glu Tyr Thr He 

GAG TTA ACT CGT GCT GAT AAT GCT GGA GAA GTT GCT GCA ATT GGC GCT 
Glu Leu Thr Arc Ala Asp Asn Ala Gly Glu Val Ala Ala He Gly Ala 

ACT AAC GGT CGC GAA TAG AAA GTT ATT GTA AAA GAT AAA GCT GGT AAC 
Thr Asn Gly Arc Glu Tyr Lys Val lie Val Lys Asp Lys Ala Gly Asn 

TTA GCT AAA AAT GAA ATC GTT AAT GTT GCA TTC AAT GAA GAT AAA GAT 
Leu Ala Lys Asn Glu He Val Asn Val Ala Phe Asn Glu Asp Lys Asp 
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FIGURE 



CGT GTA ATT TCA ACA.GTT ACA AAT 
Arg Val lie Ser Thr Val Thr Asn 

GAT ACT GCA GTA TAC TTC ACA GGC 
Asp Thr Ala Val Tyr Phe Thr Gly 

AAA ACA AAT GAT AAA GGT GAA GCT 
Lys Thr Asn Asp Lys Gly Glu Ala 

GTA AAC GAT TAT GCA ACA CCA ATT 
Val Asn Asp Tyr Ala" Thr Pro lie 

GAT GCA AAA CAA GGC GAC CTT GAT 
Asp Ala Lys Gin Gly Asp Leu Asp 

CCA ATC TCT TAC TTC CAA GCA CCA 
Pro He Ser Tyr Phe Gin Ala Pro 

GCA TAC AAA AAA TCA GAT CTT AAT 
Ala Tyr Lys Lys Ser Asp Leu Asn 

TCT GAA ACT GCA GTA TTT GCA GCA 
Ser Glu Thr Ala Val Phe Ala Ala 

AAA GTA ACT GGT ACT TCT ATT AAG 
Lys Val Thr Gly Thr Ser lie Lys 



GCT AAA TTC GTT GAT ACT GAT CCA 
Ala Lys Phe Val Asp Thr Asp Pro 

GAT AAA GCA AAA CAA ATC TCT GTA 
Asp Lys Ala Lys Gin He Ser Val 

ACA TTT GTT ATC GGT TCT GAT ACA 
Thr Phe Val lie Gly Ser Asp Thr 

GCT TGG ATT GAT ATT AAT ACT TCT 
Ala Trp He Asp He Asn Thr Ser 

GAA GGT GAA CCA AAA GCA GTT GCA 
Glu Gly Glu Pro Lys Ala Val Ala 

TAT CTT GAT GGC TCA GCT ATC AAA 
Tyr Leu Asp Gly Ser Ala He Lys 

AAA GCT GTA ACT AAG TTT GAT GGT 
Lys Ala Val Thr Lys Phe Asp Gly 

GAA TTA GTA AAC CAA AGC GGC AAA 
Glu Leu Val Asn Gin Ser Gly Lys 

AAA GCA ACT TAT ACA ATC TAC AAT 
Lys Ala Thr Tyr Thr He Tyr Asn 
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ACT GGT GCT AAT GAT ATT AAA GTA GAT AAC CAA GTT ATC TCA CCA AAT 
Thr Gly Ala Asn Asp lie Lys Val Asp Asn Gin Val He Ser Pro Asn 

CGT AGC TAC ACA GTA ACT TAT GAA GCT ACT TTA TCT TCT ACA GGA ACT 
Arq ser Tyr Thr Val Thr Tyr Glu Ala Thr Leu Ser Ser Thr Gly Thr 

GTT ATT ACA CCT GCT AAG AAT TTA GAA GTT ACT TCA GTG GAT GGT AAA 
Val He Thr Pro Ala Lys Asn Leu Glu Val Thr Ser Val Asp Gly Lys 



ACA ACT GCT GTT AAA GTA ATT GCT ACA GGT ATT GCT GTT AAT ACA GAC 
Thr 



Thr Ala Val Lys Val He Ala Thr Gly He Ala Val Asn Thr Asp 



GGT AAA GAC TAT GCA TTT ACT GCT AAA GAA GCT ACA GCT ACA TTC ACA 
Gly Lys Asp Tyr Ala Phe Thr Ala Lys Glu Ala Thr Ala Thr Phe Thr 

GCT ACA AAT GAA GTT CCA AAC TCT TAC ACT GGT GTA GCT ACT CAA TTC 
Ala Thr Asn Glu Val Pro Asn Ser Tyr Thr Gly Val Ala Thr Gin Phe 

AAT ACA GCT GAT TCT GGT TCA AAC AGC AAC TCT ATT TGG TTT GCT GGT 
Asn Thr Ala Asp Ser Gly Ser Asn Ser Asn Ser He Trp Phe Ala Gly 

AAA AAC CCA GTG AAA TAT GCT GGT GTA TCA GGC AAA ACA TAT AAA TAC 
Lys Asn Pro Val Lys Tyr Ala Gly Val Ser Gly Lys Thr Tyr Lys Tyr 

TTC GGA GCT AAT GGT AAT GAA GTA TTT GGT GAA GCG GCA TGG GAA GCA 
Phe Gly Ala Asn Gly Asn Glu Val Phe Gly Glu Ala Ala Trp Glu Ala 
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FIGURE 6g 



TTA TTA ACT CAA TAT GCA ACT GAA GGC CAA AAA GTA ACA ATC TCA TAT 
Leu Leu Thr Gin Tyr Ala Thr Glu Gly Gin Lys Val Thr lie Ser Tyr 

AAT GTA GAT GGT GAT ACA GTT ACA TTT AAA GTA ATT AGT GCT GTT AAT 
Asn Val Asp Gly Asp Thr Val Thr Phe Lys Val lie Ser Ala Val Asn 

TCT TCA ACT GAA GCT ATC AAA CCA GTT GCT CCA ACA ACA CCA GCA GCT 
Ser Ser Thr Glu Ala He Lys Pro Val Ala Pro Thr Thr Pro Aia Ala 

CCA ACT ACT GGC GCA TTA ACA TTA ACA CCA GCA GCT GGT GGT TTA GTT 
Pro Thr Thr Gly Ala Leu Thr Leu Thr Pro Ala Ala Gly Gly Leu Val 

GAT TTA ACA ACT GCA ACT AAC ACT TTA GGA ATT TCA TTA GCT GAT GCA 
Asp Leu Thr Thr Ala Thr Asn Thr Leu Gly He Ser Leu Ala Asp Ala 

GAT CTT AAT GTA AGT GCA ACA ACT GTT GAT ACT GCA ACT GTT TCA TTA 
Asp Leu Asn Val Ser Ala Thr Thr Val Asp Thr Ala Thr Val Ser Leu 

AAA GAT AGT GCA AAT AAT TCA TTA TCT CTT ACA TTA GTT GAA ACT GGT 
Lys Asp Ser Ala Asn Asn Ser Leu Ser Leu Thr Leu Val Glu Thr Gly 

GCT AAT ACA GGT GTA TTT GCT ACA ACT GTT CAA GCT GGT ACA TTA TCT 
Ala Asn Thr Gly Val Phe Ala Thr Thr Val Gin Ala Gly Thr Leu Ser 

TCT TTA ACT GCT GGT ACA TTA ACA GTT ACT TAT GCA GAT GCT AAA AAT 
Ser Leu Thr Ala Gly Thr Leu Thr Val Thr Tyr Ala Asp Ala Lys Asn 
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GCT GCA GGT GTT GCT GAA AAT ATI ACT GCT AGC GTA ACA TTA AAG AAA 
M. Ala Gly val Ala Gl» A=n He Thr Ala Ser Val Thr Leu Lys Lys 

Thr Thr Gly Ala He Thr Ser Asp Thr Phe Thr Gin Gly Val Leu Pro 

TCA GCA GCT ACA GCA GCT GAA TAT ACT TCT AAA TCA ATT GCT GCA GAT 
ser Ala Ala Thr Ala Ala Glu Tyr Thr Ser Lys Ser lie Ala Ala Asp 

TAT ACA TTT GCA ACA GGT GAA GGA TTC ACT TTA AAT ATT GAT AAT GCT 
Tyr Thr Phe Ala Thr Gly Glu Gly Phe Thr Leu. Asn He Asp Asn Ala 

GGT GCT CAA GTA ATT AAC TTA GCA GGT AAA AAA GGT GCA CAA GGT GTA 
Gly Ala Gin Val He Asn Leu Ala Gly Lys Lys Gly Ala Gin Gly Val 

Ala Asp Ala He Asn Ala Thr Phe Ala Gly Thr Ala Thr V.l Ser Gly 

GAC AAA GTA GTT ATT AAA TCA GCT ACA ACA GGT GTT GGT TCT GAA GTT 
Asp Lys val val He Lys Ser Ala Thr Thr Gly Val Gly Ser Glu Val 

GAA GTT ACA TTC TCT TCT GTT AAT CAA GTA TTA AAT GCA GTA GTT AAC 
Glu Val Thr Phe Ser ser Val Asn Gin Val Leu Asn Ala Val Val Asn 

GGT AAA GAT CAA GTC GTT GCA GGA ACA GCT GCT ACA AAA GCA TTC ACG 
Gly Lys Asp Gin Val Val Ala Gly Thr Ala Ala Thr Lys Ala Phe Thr 
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ATT ACT ACA GCC CTT TCT GTG GGT GAA AAA GTA GTT ATT GAT GGT GTT 
lie Thr Thr Ala Leu Ser Val Gly Glu Lys Val Val lie Asp Gly Val 

GAA TAT ACT GCT GTA GCA TTT GGA ACT GCT CCA ACA GCA AAT ACA TTC 
Glu Tyr Thr Ala Val Ala Phe Gly Thr Ala Pro Thr Ala Asn Thr Phe 

GTA GTT GAA TCT GCT GCT AAT ACA TTA GCT TCA GTA GCT GAC CAA GCT 
Val Val. Glu Ser Ala Ala Asn Thr Leu Ala Ser Val Ala Asp Gin Ala 

GCA AAT CTT GCT GCT ACA ATT GAT ACT TTA AAC ACT GCA GAT AAG TTT 
Ala Asn Leu Ala Ala Thr He Asp Thr Leu Asn Thr Ala Asp Lys Phe 

ACA GCT TCT GCA ACA GGT GCT ACT ATT ACA TTA ACT TCT ACT GTA ACA 
Thr Ala Ser Ala Thr Gly Ala Thr He Thr Leu Thr Ser Thr Val Thr 

CCA GTA GGT ACT ACA ATT ACT GAA CCA GTA ATT ACA TTA AAA 
Pro Val Gly Thr Thr He Thr Glu Pro Val He Thr Leu Lys 

T AAG C AAT T A ACTTAAAATA CTTTTAATTA TTTGCCTATT T TAT AAT TTC TATGACTCT 

TGAGA TAACA ATCTCATAGA GTC TTTTTTA T T T T TAG AAC CTCTAGATAG AAAGAAATT 

GAATTTATTA TGAAATTTAT AAAG AAG TCT TGTAACCTTT TATAAGGTAA CTAGTCTA-. 

TAAGAGAGTT AT G T AAAAG C AATATAT AT C GATTCATATT ATT TAAAAGG CTAAAATTA 

TGTTTTAACT CAAACGGGGG T G G T AAC AAA AG T T AAT CAA G C AG C AAT GA GTTTTCTAG. 
AAAT AT T CAT GAAATTCTGG AAATCCTTAT TGCTTTATAT GAAGCTT 
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pSL151 pSL152 pSL153 

GAATTCGCTA AGAAACGCCT TCTATATTTC GGTTTCTTTA CAATTATAAC TAAAAT ATTA 

-35 

p3 
T 

CGGGAGTCTT TAATTTTTGA CA AT T TAG T A AC CAT T C CAG AAAATG CTTG GTT ATTATTG 
-10 -35 

p2 pSL154 

▼ 4- 

AGAGTAAGG T AT AAT A G G T A AC G G AAC TAT ATGTTACCAA TCCAAATGAG GATATAATTA 
-10 

pSL155 
I 

GTTGTAATTT TAATGGTTTC T AC C AAATAC CAT AT TAGG T ATGGTAAAAA AATCTTCTAT 

pSL156 P 1 

I ▼ 

AACTAAATTT AT G T CC CAAT GC TTGAAT TT CGGAAAAGAT AG T G T TAT AT TA TTGTAGAA 

^35~ -10 

pSL157 
I 

AG T G AAT AAA C T T AC T AGAA TGGTATTCTA CTACGCTTTT T C TAG T AAAT T T AC T AAC AA 

pSL158 
I 

ATTTGCTTTA GTTTTGTATT AT T C AAGAAA G C TAT AAT AC ATACATTTAG G T AAC TAG G C 



-35* -10 



pSL159 



GGTACTATAG TTTTCGTTGG AT T AAT AT C A ATTTAAGGAA TTTTAGGGAG GAATACATT.-. 
ATGGCAAAGC AAAACAAAGG CCGTAAGTTC TTCGCGGCAT CAGCAACAGC TGCATTAGTT 



GCATCGGCAA TCGTACCTGT AGCATCTGCT GCACAAGTAA ACGAC TATAA CAAAATCTCT 
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FIG. 16 
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FIG. 18A 



FIG. IBB 
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FIG. 18D 
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FIG. 20 
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FIG. 21C 
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